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International preliminary examination fee paid to USPTO 

(37 CFR 1.492(a)(1)) S690.00 

No international preliminary examination fee paid to USPTO (37 CFR 1.492 

(a)(1)) but international search fee paid to USPTO (37 CFR 1.492(a)(2)) S710.00 
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nor international search fee (37 CFR 1.492(a)(2)) paid to USPTO 51,000.00 
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(a)(4)) and all claims satisfied provisions of PCT Article 33(l)-(4) $100.00 
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jw&utiple dependent claim(s) (if applicable) 



TOTAL OF ABOVE CALCULATIONS = 



Q Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above are 
g \ reduced by 'A j 
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Kbnths from the earliest claimed priority date (37 CFR 1.492(f)). 



TOTAL NATIONAL FEE = 



lee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). S40.00 per property + 



TOTAL FEES ENCLOSED = 
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to cover the above fees is enclosed. 

in the amount of $ to cover the above fees. A duplicate copy of 



a. IXJ A check in the amount of $ 2822.00 
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this sheet is enclosed. 

c. [x] The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment 

to Deposit Account No. 06-1205 A duplicate copy of this sheet is enclosed. 

d. Q Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 

information should not be included on this form. Provide credit card information and authorization on PTO-2038. 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has nbt been met, a petitionto Revive (37 CFR 

1.137(a) or (b)) must be filed and granted to restore the application to pending status. \ 
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HISASHI NARIMATSU, ET AL. 

Application No.: (National Phase of 
PCT/JP00/01070) 
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For: NOVEL POLYPEPTIDES 

Assistant Commissioner for Patents 
Washington, D.C. 20231 



Examiner: Not Yet Assigned 
Group Art Unit: N/Y/A 



August 22, 2001 



Sir: 



as follows: 



PRELIMINARY AMENDMENT 



Prior to action on the merits, please amend the above-identified application 



IN THE SPECIFICATION 

Please substitute the paragraph at page 4, lines 8-13 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

The object of the present invention is to provide a pharmaceutical 

preparation for anti-inflammations, anti-infections or inhibition of cancer metastasis, foods 

such as dairy products, a method of improving proteins, and a method for diagnosis of 



diseases such as cancers, by utilizing a novel polypeptide having pM,3-galactosyltransferase 
activity. 

Please substitute the paragraph starting at pagelO, line 36 and ending at 
page 1 1, line 2 with the following replacement paragraph. A marked-up copy of this 
paragraph, showing the changes made thereto, is attached. 

25. A method for determining the expression level of a gene encoding the 
polypeptide of any one of items 1 to 4, which comprises hybridization using DNA coding 
for said polypeptide or a fragment of said DNA. 

Please substitute the paragraph at page 11, lines 10-27 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

27. A DNA according to item 26 wherein the oligonucleotide derivative is 
selected from the group consisting of an oligonucleotide derivative in which the 
phosphodiester bond is converted into a phosphorothioate bond, an oligonucleotide 
derivative in which the phosphodiester bond is converted into an N3'-P5'-phosphoamidate 
bond, an oligonucleotide derivative in which the ribose and the phosphodiester bond are 
converted into a peptide-nucleic acid bond, an oligonucleotide derivative in which the 
uracil is replaced by a C-5 propynyluracil, an oligonucleotide derivative in which the uracil 
is replaced by a C-5 thiazolyluracil, an oligonucleotide derivative in which the cytosine is 
replaced by a C-5 propynylcytosine, an oligonucleotide derivative in which the cytosine is 
replaced by a phenoxazine-modified cytosine, an oligonucleotide derivative in which the 
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ribose is replaced by a 2'-0-propylribose, and an oligonucleotide derivative in which the 
ribose is replaced by a 2'-methoxyethoxyribose. 

Please substitute the paragraph at page 1 1, lines 30-33 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

29. A method for determining the expression level of a gene encoding the 

polypeptide of any one of items 1 to 4, which comprises polymerase chain reaction using 

the DNA of any one of items 26 to 28. 

Please substitute the paragraph at page 17, lines 30-33 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

The expression vector into which the cDNA has been integrated is 

introduced into animal cells capable of expressing the objective cDNA, to give transformed 

cells. 

Please substitute the paragraph at page 22, lines 7-26 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

The expression vector includes, e.g., pBTrp2, pBTacl, pBTac2 (which all 

are commercially available from Boehringer Mannheim), pKK233-2 (a product of 

Pharmacia), pSE280 (a product of Invitrogen), pGEMEX-1 (a product of Promega), pQE-8 

(a product of QIAGEN), pKYPIO (Japanese Published Unexamined Patent Application 

No. 110600/83), pKYP200 [Agric. Biol. Chem., 48, 669 (1984)], pLSAl [Agric Biol. 
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Chem., 53, 277 (1989)], pGELl [Proc. Natl. Acad. Sci., USA, 82, 4306 (1985)], 
pBluescript II SK+ (a product of Stratagene), pBluescript II SK(-) (a product of 
Stratagene), pTrs30 (FERM BP-5407), pTrs32 (FERM BP-5408), pGHA2 (FERM BP- 
400), pGKA2 (FERM BP-6798), pTerm2 (Japanese Published Unexamined Patent 
Application No. 22979/91, US 4686191, US 4939094, US 5160735), pEG400 [J. 
Bacterid., 172, 2392 (1990)], pGEX (a product of Pharmacia), pET system (a product of 
Novagen), pSupex, pUBHO, pTP5, pC194, pTrxFus (a product of Invitrogen), pMAL-c2 
(a product of New England Biolabs), pUC19 [Gene, 33, 103 (1985)], pSTV28 (a product of 
Takara Shuzo Co., Ltd.), pUCl 18 (a product of Takara Shuzo Co., Ltd.) and pPAl 
(Japanese Published Unexamined Patent Application No. 233798/88). 

Please substitute the paragraph at page 38, lines 9-23 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

That is, N-acetylglucosamine monosaccharides, oligosaccharides having N- 

acetylglucosamine residue at the non-reducing termini thereof, or complex carbohydrates 

having N-acetylglucosamine residue at the non-reducing termini of sugar chains thereof, 

are used as the acceptor substrate while the polypeptide of the present invention obtained in 

the method described in item (2) above is used as the enzyme source, and reaction products 

having galactose transferred via (3 1,3 -linkage to N-acetylglucosamine monosaccharide or 

N-acetylglucosamine residue of the acceptor substrate can be produced by allowing said 

receptor substrate, said enzyme source and uridine-5' -diphosphate galactose (UDP-Gal) to 
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be present in an aqueous medium to thereby form and accumulate said reaction products in 
said aqueous medium and collecting said reaction products from said aqueous medium. 

Please substitute the paragraph starting at page 44, line 36 and ending at 
page 45, line 7 with the following replacement paragraph. A marked-up copy of this 
paragraph, showing the changes made thereto, is attached. 

The class and subclass of the antibody are determined using a mouse 

monoclonal antibody typing kit or a rat monoclonal antibody typing kit. The class of the 

antibody means isotype of the antibody, and for example, mention can be made of IgG, 

IgA, IgM,IgD and IgE in human. The subclass of the antibody means isotype in the class, 

and for example, mention can be made of IgGl, IgG2a, IgG2b and IgG3 in mouse, and 

IgGl, IgG2, IgG3 and IgG4 in human. 

Please substitute the paragraph starting at page 54, line 29 and ending at 
page 55, line 4 with the following replacement paragraph. A marked-up copy of this 
paragraph, showing the changes made thereto, is attached. 

Fig. 1, A shows the results of measurement of the expression levels of type 

1 sugar chains (sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis b sugar chain) in a 

wide variety of human cancer cell lines. Each kind of cell was stained with a fluorescent 

antibody using anti-sialyl-Lewis a sugar chain antibody (19-9), anti-Lewis a sugar chain 

antibody (7LE) or anti-Lewis b sugar chain antibody(TT42) and then analyzed by FACS. 

The reactivity with each antibody is shown as +++, ++, +, ±, and - in the order of 

decreasing reactivity. - means the absence of reactivity with the antibody. NT means 

that the analysis was not conducted 
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Please substitute the paragraphs at page 55, lines 15-32 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

Fig. 3, A shows the results of indirect fluorescent antibody staining with 
anti-sialyl-Lewis c sugar chain antibody (DU-PAN-2) and subsequent FACS analysis of 
Namalwa cells (Namalwa-mock) having the control plasmid (pAMo) introduced therein, or 
Namalwa cells (Namalwa-3GT5) having human (33Gal-T5 expression plasmid (pAMo- 
3GT5) introduced therein. The shaded histogram shows the result of analysis using A-PBS 
in place of DU-PAN-2. 

Fig. 3, B shows the results of indirect fluorescent antibody staining with anti-sialyl- 
Lewis a sugar chain antibody (19-9), anti-sialyl-Lewis c sugar chain antibody (DU-PAN-2), 
anti-Lewis a sugar chain antibody (7LE) or anti-Lewis b sugar chain antibody (TT42) and 
subsequent FACS analysis of HCT-15 cells (HCT-mock) having the control plasmid 
(pAMo) introduced therein, or HCT-15 cells (HCT-3GT5H) having human (33Gal-T5 
expression plasmid (pAMo-3GT5) introduced therein. The shaded histogram shows the 
result of analysis using A-PBS in place of DU-PAN-2. 

Please substitute the paragraph at page 56, lines 22-29 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

Fig. 6, C shows the results of examination of the expression level of each 

isoform of human (33Gal-T5 cDNA in Colo205 cells by the RT-PCR method. After RT- 

PCR was conducted with the combination of primers shown in Fig. 6, A, the reaction 



-6- 



product was cleaved with a restriction enzyme (Xbal or BsmT ) shown in Fig. 6, to specify 
the isoform.(-) means that the restriction enzyme treatment was not conducted. The left 
line shows molecular markers (100 bp ladder). 

Please substitute the paragraph at page 58, lines 8-13 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

The cells described above were cultured in a medium suitable for the 

respective cells, and then the cells were subjected to fluorescent antibody staining with 

anti-sialyl-Lewis a sugar chain antibody (19-9), anti-Lewis a sugar chain antibody (7LE) or 

anti-Lewis b sugar chain antibody (TT42) and analyzed by FACS. 

Please substitute the paragraph at page 60, lines 16-19 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

As the internal controls, linear DNAs were prepared by cleaving plasmids 

prepared below ( pUCl 19-3GTld, pBS-3GT2d, pBS-3GT3d, pBS-3GT4d) with suitable 

restriction enzymes to take the cDNA inserts. 

Please substitute the paragraph at page 60, lines 24-25 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

By deleting a 183 -bp sequence between Styl-Styl in human p3Gal-T3 
cDNA in pBS-3GT3, pBS-3GT3d was prepared. 



Please substitute the paragraph at page 71, lines 5-10 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

The single clone (HCT-3GT5H) thus obtained was subjected to indirect 

fluorescent antibody staining with anti-sialyl-Lewis a sugar chain antibody (19-9), anti- 

sialyl-Lewis c sugar chain antibody (DU-PAN-2, a product of KyowaMedex), anti-Lewis a 

sugar chain antibody (7LE) or anti-Lewis b sugar chain antibody (TT42). 



Please substitute the paragraph at page 73, lines 19-21 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 



The respective cells (lxl 0 7 cells) were suspended in a solution [20 mmol/1 
HEPES (pH 7.2), 2% Triton X-100] and sonicated in a short time to prepare a cell lysate. 

Please substitute the paragraph at page 76 , lines 1-3 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

Said transformed cells were suspended in a solution [20 mmol/1 HEPES (pH 

7.2), 2% Triton X-100] and sonicated in a short time to prepare a cell lysate solution. 



Please substitute the paragraph at page 76, lines 14-19 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 



Specifically, the activity was measured by identifying a product by high 
performance liquid chromatography (HPLC) after reaction at 37°C for 2 hours in 10 ul 
assay solution [14 mmol/1 HEPES (pH 7.4), 75 umol/1 UDP-Gal (a product of SIGMA), 1 1 
umol/1 MnCl 2 , 0.01% Triton X-100, 25 umol/1 pyridylaminated sugar chain substrate, and 
the above cell lysate solution]. 

Please substitute the paragraphs at page 79, lines 12-27 with the following 
replacement paragraph. A marked-up copy of this paragraph, showing the changes made 
thereto, is attached. 

The (33Gal-T5 transcripts in various human tissues (brain, lung, esophagus, 
stomach (body), stomach (antrum), jejunum, colon, liver, pancreas, spleen, kidney, adrenal, 
uterus, peripheral blood lymphocytes) were quantified by RT-PCR in the same manner as 
in item (7) in Example 2. The amount of the )33Gal-T5 gene transcript in each kind of 
organs is shown as a value relative to the amount (= 1000) of the P-actin transcript (Fig. 5). 

It was revealed that the (33Gal-T5 transcripts are significantly expressed in 
the stomach (body), stomach (antrum), jejunum, colon and pancreas. Further, the p3Gal- 
T5 transcripts were slightly expressed in the brain, esophagus, kidney and uterus. On the 
other hand, the p3Gal-T5 transcripts were not expressed in the lung, liver, spleen, adrenal 
and peripheral blood lymphocytes. 

REQUEST FOR APPROVAL OF DRAWING CHANGES : 

The Examiner is requested to approve the following changes to the 

drawings: 
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In Fig. 5, bottom left, please change "Peripheral lymphocytes" to 
--Peripheral blood lymphocytes--. 

Enclosed is a Request for Approval of Proposed Drawing Changes. An 
indication of the Examiner's approval of the proposed changes is respectfully requested in 
the next Patent Office Communication. 

IN THE CLAIMS : 

Please amend Claims 5, 13, 14, 16-19, 23, 25, 29-34, 37-40, 42-44 and 46 to 
read as follows. A marked-up copy of Claims 5, 13, 14, 16-19, 23, 25, 29-34, 37-40, 42-44 
and 46 , showing the changes made thereto, is attached. 

5. (Amended) A DNA selected from the group consisting of: 

(a) DNA coding for the polypeptide described in claims 1 or 2, 

(b) DNA having the nucleotide sequence of 402 to 133 1 in the nucleotide 
sequence represented by SEQ ID NO: 2, 

(c) DNA having the nucleotide sequence of 492 to 1331 in the nucleotide 
sequence represented by SEQ ID NO: 2, and 

(d) DNA hybridizing under stringent conditions with the DNA described in 
any of (a) to (c) and coding for a polypeptide having bl,3-galactosyltransferase activity 
capable of synthesizing Galb 1 -3 GlcNAc structure. 
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13. (Amended) A process for producing the polypeptide described in 
claims 1 or 2, which comprises culturing a transformant harboring a recombinant DNA 
prepared by integrating DNA coding for the polypeptide into a vector in a medium to 
thereby form and accumulate said polypeptide in the culture, and collecting said 
polypeptide from said culture. 

14. (Amended) A process for producing the polypeptide described in 
claims 1 or 2, which comprises breeding a non-human transgenic animal harboring a 
recombinant DNA prepared by integrating DNA coding for the polypeptide into a vector to 
thereby form and accumulate said polypeptide in said animal, and collecting. 

16. (Amended) A process for producing the polypeptide described in 
claims 1 or 2, which comprises culturing a transgenic plant harboring a recombinant DNA 
prepared by integrating DNA coding for the polypeptide into a vector to thereby form and 
accumulate said polypeptide in said plant, and collecting said polypeptide from said plant. 

17. (Amended) A process for producing the polypeptide described in 
claims 1 or 2, which comprises synthesizing the polypeptide in an in vitro transcription- 
translation system using DNA coding for said polypeptide. 



18. (Amended) A process for producing a reaction product having 
galactose, which comprises using a polypeptide having pi, 3- galactosyltransferase activity 
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involved in the synthesis of sialyl-Lewis a sugar chain, present in colon cancer cells 
expressing sialyl-Lewis a sugar chain, or a polypeptide which is selected from the group 
consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b), one or more amino acids have been deleted, 
replaced or added and having pi,3-galactosyltransferase activity capable of synthesizing 
Gaipi-3GlcNAc structure as an enzyme source, and allowing 

(a) said enzyme source, 

(b) an acceptor substrate selected from the group consisting of: 

i) N-acetylglucosamine (GlcNAc), 

ii) an oligosaccharide having N-acetylglucosamine residue at the 
non-reducing terminus thereof, and 

iii) a complex carbohydrate having N-acetylglucosamine residue at 
the non-reducing terminus thereof, and 

(c) uridine-5' -diphosphate galactose to be present in an aqueous medium to 
thereby form and accumulate said reaction product in the aqueous medium, and collecting 
said reaction product from said aqueous medium, wherein the galactose is transferred via 
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j31,3-linkage to N-acetylglucosamine or N-acetylglucosamine residue of said acceptor 
substrate. 

19. (Amended) A process for producing a reaction product having 
galactose, which comprises using a polypeptide having pi,3- galactosyltransferase activity 
involved in the synthesis of sialyl-Lewis a sugar chain, present in colon cancer cells 
expressing sialyl-Lewis a sugar chain, or a polypeptide which is selected from the group 
consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 1 0 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b), one or more amino acids have been deleted, 
replaced or added and having |3 1,3 -galactosyltransferase activity capable of synthesizing 
Gaipi-3GlcNAc structure as an enzyme source, and allowing 

(a) said enzyme source, 

(b) an acceptor substrate selected from the group consisting of: 

i) glucose, 

ii) an oligosaccharide having glucose residue at the non-reducing 

terminus thereof, and 
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iii) a complex carbohydrate having glucose residue at the non- 
reducing terminus thereof, and 

(c) uridine-5' -diphosphate galactose to be present in an aqueous medium to 
thereby form and accumulate said reaction product in the aqueous medium, and collecting 
said reaction product from said aqueous medium, wherein the galactose is transferred via 
(3 1,3 -linkage to glucose or glucose residue of said acceptor substrate. 

23. (Amended) A process according to any of claims 18 to 20 wherein the 
complex carbohydrate is a complex carbohydrate selected from the group consisting of a 
glycoprotein, a glycolipid, a proteoglycan, a glycopeptide, a lipopolysaccharide, a 
peptidoglycan and a glycoside which is a steroid compound with a sugar chain. 

25. (Amended) A method for determining the expression level of a gene 
encoding the polypeptide of claims 1 or 2, which comprises hybridization using DNA 
coding for said polypeptide or a fragment of said DNA. 

29. (Amended) A method for determining the expression level of a gene 
encoding a polypeptide having pi, 3- galactosyltransferase activity involved in the synthesis 
of sialyl-Lewis a sugar chain, present in colon cancer cells expressing sialyl-Lewis a sugar 
chain, or a polypeptide which is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 
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(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b), one or more amino acids have been deleted, 
replaced or added and having (31,3-galactosyltransferase activity capable of synthesizing 
GalJ31-3GlcNAc structure, which comprises polymerase chain reaction using the DNA of 
claim 26. 

30. (Amended) A method for detecting cancers and cancer metastasis, 
which comprises using the method of claim 29. 

31. (Amended) A method for inhibiting transcription of DNA coding a 
polypeptide having (31,3- galactosyltransferase activity involved in the synthesis of sialyl- 
Lewis a sugar chain, present in colon cancer cells expressing sialyl-Lewis a sugar chain, or 
a polypeptide which is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b), one or more amino acids have been deleted, 
replaced or added and having pi, 3 -galactosyltransferase activity capable of synthesizing 
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Galpl-3GlcNAc structure or translation of its corresponding mRNA, which comprises 
using a DNA of claim 26 and or a DNA having a nucleotide sequence represented by SEQ 
ID NO: 2 or 3. 

32. (Amended) An antibody recognizing the polypeptide of claims 1 or 2. 

33. (Amended) A method for immunological detection of a polypeptide 
having pi, 3- galactosyltransferase activity involved in the synthesis of sialyl-Lewis a sugar 
chain, present in colon cancer cells expressing sialyl-Lewis a sugar chain, or a polypeptide 
which is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 3 1 to 310 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b), one or more amino acids have been deleted, 
replaced or added and having pi, 3 -galactosyltransferase activity capable of synthesizing 
Gaipi-3GlcNAc structure, which comprises using the antibody of claim 32. 

34. (Amended) An immunohisto staining method, which comprises 
detecting a polypeptide of having pl,3- galactosyltransferase activity involved in the 
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synthesis of sialyl-Lewis a sugar chain, present in colon cancer cells expressing sialyl- 
Lewis a sugar chain, or a polypeptide which is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO. 1, 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b), one or more amino acids have been deleted, 
replaced or added and having pi,3-galactosyltransferase activity capable of synthesizing 
Gaipi-3GlcNAc structure by using the antibody of claim 32. 

37. (Amended) A method for screening a compound varying the activity of 
the polypeptide of claims 1 or 2, which comprises contacting said polypeptide with a test 
sample. 

38. (Amended) A method for screening a compound varying the 
expression of a gene coding for the polypeptide of claims 1 or 2, which comprises 
contacting cells expressing said polypeptide with a test sample and determining the content 
of sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis b sugar chain or sialyl-Lewis c 
sugar chain by use of anti-sialyl-Lewis a antibody, anti-Lewis a antibody, anti-Lewis b 
antibody or anti-sialyl-Lewis c antibody. 
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39. (Amended) A method for screening a compound varying the 
expression of a gene coding for a polypeptide having 01,3- galactosyltransferase activity 
involved in the synthesis of sialyl-Lewis a sugar chain, present in colon cancer cells 
expressing sialyl-Lewis a sugar chain, or a polypeptide which is selected from the group 
consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b), one or more amino acids have been deleted, 
replaced or added and having |3 1,3 -galactosyltransferase activity capable of synthesizing 
Gaipi-3GlcNAc structure, which comprises contacting cells expressing said polypeptide 
with a test sample and determining the content of said polypeptide by use of the antibody 
of claim 32. 

40. (Amended) A promoter DNA governing transcription of a gene coding 
for a polypeptide having pi ,3- galactosyltransferase activity involved in the synthesis of 
sialyl-Lewis a sugar chain, present in colon cancer cells expressing sialyl-Lewis a sugar 
chain, or a polypeptide which is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 
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(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b), one or more amino acids have been deleted, 
replaced or added and having (31,3-galactosyltransferase activity capable of synthesizing 
Gaip 1 -3 GlcNAc structure. 

42. (Amended) A promoter DNA according to claim 41, which is a human- 
or mouse-derived promoter DNA. 

43. (Amended) A promoter DNA according to claims 40 or 41, which 
comprises a 50- to 5000-bp consecutive nucleotide DNA sequence in the nucleotide 
sequence of 1 to 5000 in the nucleotide sequence represented by SEQ ID NO: 3. 

44. (Amended) A method for screening a compound varying the efficiency 
of transcription by the promoter DNA of claims 40 or 41, which comprises transforming 
animal cells with a plasmid containing said promoter DNA and a reporter gene ligated 
downstream of said promoter DNA, then contacting the transformant with a test sample, 
and determining the content of a translation product of said reporter gene. 

46. (Amended) A knockout non-human animal wherein a DNA coding for 
the polypeptide of claims 1 or 2 is rendered defective or mutated. 
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REMARKS 

Claims 5, 13, 14, 16-19, 23, 25, 29-34, 37-40, 42-44 and 46 have been 
amended to correct their dependency and conformity with accepted U.S. practice. 



No new matter has been added. 
Entry hereof is earnestly solicited. 

Applicants' undersigned attorney may be reached in our New York office by 



telephone at (212) 218-2100. All correspondence should continue to be directed to our 
below listed address. 



FITZPATRICK, CELLA, HARPER & SCINTO 
30 Rockefeller Plaza 
New York, New York 10112-3801 
Facsimile: (212)218-2200 

LSP\ac 



NY MAIN 194328 vl 




Respectfully submitted, 



Attorney for Applicants 
Registration No. 3 1 ,865 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE TO CLAIMS 

5. (Amended) A DNA selected from the group consisting of: 

(a) DNA coding for the polypeptide described in [any one of] claims 1 [to 4] 

or 2 , 

(b) DNA having the nucleotide sequence of 402 to 1331 in the nucleotide 
sequence represented by SEQ ID NO: 2, 

(c) DNA having the nucleotide sequence of 492 to 133 1 in the nucleotide 
sequence represented by SEQ ID NO: 2, and 

(d) DNA hybridizing under stringent conditions with the DNA described in 
any of (a) to (c) and coding for a polypeptide having bl,3-galactosyltransferase activity 
capable of synthesizing Galbl-3GlcNAc structure. 

13. (Amended) A process for producing the polypeptide described in [any 
one of ] claims 1 [to 4] or 2 , which comprises culturing a transformant harboring a 
recombinant DNA prepared by integrating DNA coding for the polypeptide [of any one of 
claims 1 to 4] into a vector in a medium to thereby form and accumulate said polypeptide 
in the culture, and collecting said polypeptide from said culture. 
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14. (Amended) A process for producing the polypeptide described in [any 
one of] claims 1 [to 4] or 2 , which comprises breeding a non-human transgenic animal 
harboring a recombinant DNA prepared by integrating DNA coding for the polypeptide [of 
any one of claims 1 to 4] into a vector to thereby form and accumulate said polypeptide in 
said animal, and collecting. 

16. (Amended) A process for producing the polypeptide described in [any 
one of] claims 1 [to 4] or 2 . which comprises culturing a transgenic plant harboring a 
recombinant DNA prepared by integrating DNA coding for the polypeptide [of any one of 
claims 1 to 4] into a vector to thereby form and accumulate said polypeptide in said plant, 
and collecting said polypeptide from said plant. 

17. (Amended) A process for producing the polypeptide described in [any 
one of] claims 1 [to 4] or 2 , which comprises synthesizing the polypeptide [of any one of 
claims 1 to 4] in an in vitro transcription-translation system using DNA coding for said 
polypeptide. 

18. (Amended) A process for producing a reaction product having 
galactose, which comprises using [the] a polypeptide having [31,3- galactosyltransf erase 
activity involved in the synthesis of sialvl-Lewis a sugar chain, present in colon cancer 
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cells expressing sialyl-Lewis a sugar chain, or a polypeptide which is selected from the 
group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: L 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b\ one or more amino acids have been deleted, 
replaced or added and having pl,3-galactosyltransferase activity capable of synthesizing 
Gal[31-3GlcNAc structure [of any one of claims 1 to 4] as an enzyme source, and allowing 

(a) said enzyme source, 

(b) an acceptor substrate selected from the group consisting of: 

i) N-acetylglucosamine (GlcNAc), 

ii) an oligosaccharide having N-acetylglucosamine residue at the 
non-reducing terminus thereof, and 

iii) a complex carbohydrate having N-acetylglucosamine residue at 
the non-reducing terminus thereof, and 

(c) uridine-5' -diphosphate galactose to be present in an aqueous medium to 
thereby form and accumulate said reaction product in the aqueous medium, and collecting 
said reaction product from said aqueous medium, wherein the galactose is transferred via 

-iii- 



Application No (National Phase of PCT/JPOO/01070) 
Attorney Docket No. 2139 25 

J3 1,3-linkage to N-acetylglucosamine or N-acetylglucosamine residue of said acceptor 
substrate. 



19. (Amended) A process for producing a reaction product having 
galactose, which comprises using [the] a polypeptide [of any of claims 1 to 4] having [31,3- 
galactosyltransferase activity involved in the synthesis of sialvl-Lewis a sugar chain, 
present in colon cancer cells expressing sialyl-Lewis a sugar chain, or a polypeptide which 
is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b\ one or more amino acids have been deleted, 
replaced or added and having |31,3-galactosvkransferase activity capable of synthesizing 
Gaipi-3GlcNAc structure as an enzyme source, and allowing 

(a) said enzyme source, 

(b) an acceptor substrate selected from the group consisting of: 

i) glucose, 

ii) an oligosaccharide having glucose residue at the non-reducing 

terminus thereof, and 
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iii) a complex carbohydrate having glucose residue at the non- 
reducing terminus thereof, and 

(c) uridine-5' -diphosphate galactose to be present in an aqueous medium to 
thereby form and accumulate said reaction product in the aqueous medium, and collecting 
said reaction product from said aqueous medium, wherein the galactose is transferred via 
P 1,3 -linkage to glucose or glucose residue of said acceptor substrate. 

23. (Amended) A process according to any [one] of claims 18 to 20 [22] 
wherein the complex carbohydrate is a complex carbohydrate selected from the group 
consisting of a glycoprotein, a glycolipid, a proteoglycan, a glycopeptide, a 
lip opoly saccharide, a peptidoglycan and a glycoside which is a steroid compound with a 
sugar chain. 

25 . (Amended) A method for determining the expression level of a gene 
encoding the polypeptide of [any one of] claims 1 [to 4] or_2, which comprises 
hybridization using DNA coding for said polypeptide or a fragment of said DNA. 

29. (Amended) A method for determining the expression level of a gene 
encoding [the] a polypeptide [of any one of claims 1 to 4] having pl,3- 
galactosyltransferase activity involved in the synthesis of sialyl-Lewis a sugar chain, 
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Application No. (National Phase of PCT/JP00/01070) 
Attorney Docket No. 2 1 39.25 

present in colon cancer cells expressing sialyl-Lewis a sugar chain, or a polypeptide which 
is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1. 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 1 0 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b\ one or more amino acids have been deleted, 
replaced or added and having (31.3-galactosvltransferase activity capable of synthesizing 
Gaipi-3GlcNAc structure , which comprises polymerase chain reaction using the 
[oligonucleotide] DNA of [any one of] claim[s] 26 [to 28]. 

30. (Amended) A method for detecting cancers and cancer metastasis, 
which comprises using the method of claim [25 or] 29. 

3 1 . (Amended) A method for inhibiting transcription of DNA coding a [for 
the] polypeptide [of any one of claims 1 to 4] having Bl,3- galactosvltra nsferase activity 
involved in the synthesis of sialyl-Lewis a sugar chain, present in colon cancer cells 
expressing sialvl-Lewis a sugar chain, or a polypeptide which is selected from t he group 
consisting of: 
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(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 31 to 3 10 in the 
amino acid sequence represented by SEP ID NO: L and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b\ one or more amino acids have been deleted, 
replaced or added and having [31.3-galactosvltransferase activity capable of synthesizing 
Gal(31-3GlcNAc structure or translation of its corresponding mRNA, which comprises 
using a DNA [selected from DNAs] of claim[s 5 and] 26 [to 28] and ora DNA[s] having a 
nucleotide sequence represented by SEQ ID NO: 2 or 3. 

32. (Amended) An antibody recognizing the polypeptide of [any one of] 
claims 1 [to 4] or 2 . 

33 . (Amended) A method for immunological detection of [the] a 
polypeptide [of any one of claims 1 to 4] having pl.3- galactosvl transferase activity 
involved in the synthesis of sialvl-Lewis a sugar chain, present in colon cancer cells 
expressing sialvl-Lewis a sugar chain, or a polypeptide which is selected from the group 
consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1. 
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(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEO ID NO; 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b\ one or more amino acids have been deleted, 
replaced or added and having (31,3-galactosyltransferase activity capable of synthesizing 
Gaipi-3GlcNAc structure , which comprises using the antibody of claim 32. 

34. (Amended) An immunohistostaining method, which comprises 
detecting [the] a polypeptide of [any one of claims 1 to 4] having (31,3- 
galactosyltransferase activity involved in the synthesis of sialvl-Lewis a sugar chain, 
present in colon cancer cells expressing siafyl-Lewis a sugar chain, or a polypeptide which 
is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEP 

ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEO ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (b\ one or more amino acids have been deleted, 
replaced or added and having (31,3-galactosyltransferase activity capable of synthesizing 
Gaip 1 -3 GlcNAc structure by using the antibody of claim 32. 
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37. (Amended) A method for screening a compound varying the activity of 
the polypeptide of [any one of] claims 1 [to 4] or 2 , which comprises contacting said 
polypeptide with a test sample. 

38. (Amended) A method for screening a compound varying the 
expression of a gene coding for the polypeptide of [any one of] claims 1 [to 4] or 2 , which 
comprises contacting cells expressing said polypeptide with a test sample and determining 
the content of sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis b sugar chain or 
sialyl-Lewis c sugar chain by use of anti-sialyl-Lewis a antibody, anti-Lewis a antibody, 
anti-Lewis b antibody or anti-sialyl-Lewis c antibody. 

39. (Amended) A method for screening a compound varying the 
expression of a gene coding for [the] a polypeptide [of any one of claims 1 to 4] having 
pi .3- galactosvltransferase activity involved in the synthesis of sialyl-Lewis a sugar chain, 
present in colon cancer cells expressing sialyl-Lewis a sugar chain, or a polypeptide which 
is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence represented by SEQ 

ID NO: 1. 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEP ID NO: L and 
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(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide fa) or (b\ one or more amino acids have been deleted, 
replaced or added and having B1.3-galactosvltransferase activity capable of synt hesizing 
GalBl-3GlcNAc structure , which comprises contacting cells expressing said polypeptide 
with a test sample and determining the content of said polypeptide by use of the antibody 
of claim 32. 

40. (Amended) A promoter DNA governing transcription of a gene coding 
for [the] a polypeptide [described in any one of claims 1 to 4] having 31,3- 
galactosvltransferase activity involved in the synthesis of sialvl-Lewis a sugar chain, 
present in colon cancer cells expressing sialvl-Lewis a sugar chain, or a polypeptide which 
is selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence r epresented bv SEP 

ID NO: 1. 

(b) a polypeptide containing the amino acid sequence of 3 1 to 3 10 in the 
amino acid sequence represented by SEQ ID NO: 1, and 

(c) a polypeptide consisting of an amino acid sequence where in the amino 
acid sequence of the polypeptide (a) or (h\ one or more amino acids have been deleted, 
replaced or added and having BU-galactosvltransferase activity capable of synthesizing 
Galf31-3GlcNAc structure . 
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42. (Amended) A promoter DNA according to claim [40 or] 41, which is a 
human- or mouse-derived promoter DNA. 

43 . (Amended) A promoter DNA according to [any one of] claims 40 [to 
42] or 41 , which comprises a 50- to 5000-bp consecutive nucleotide DNA sequence in the 
nucleotide sequence of 1 to 5000 in the nucleotide sequence represented by SEQ ID NO: 3. 

44. (Amended) A method for screening a compound varying the efficiency 
of transcription by the promoter DNA of [any one of] claims 40 [to 43] or 41 , which 
comprises transforming animal cells with a plasmid containing said promoter DNA and a 
reporter gene ligated downstream of said promoter DNA, then contacting the transformant 
with a test sample, and determining the content of a translation product of said reporter 
gene. 

46. (Amended) A knockout non-human animal wherein a DNA coding for 
the polypeptide of [any one of] claims 1 [to 4] or2 is rendered defective or mutated. 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE TO SPECIFICATION 

The paragraph at page 4, lines 8-13 have been amended as follows: 
The object of the present invention is to provide a pharmaceutical 
preparation for anti-inflammations, anti-infections or inhibition of cancer metastasis, foods 
such as dairy products, a method of improving proteins, and a method for diagnosis of 
diseases such as cancers, by utilizing a novel polypeptide having |31,3-galactosyltransferase 
activity . 

The paragraph starting at page 10, line 36 and ending at page 1 1, line 2 has 
been amended as follows: 

25. A method for determining the expression level of a gene encoding the 

polypeptide of any one of items 1 to 4, which comprises hybridization using DNA coding 

for said polypeptide or a fragment of said DNA . 

The paragraph at page 11, lines 10-27 have been amended as follows: 
27. A[n oligonucleotide] DNA according to item 26 wherein the 
oligonucleotide derivative is selected from the group consisting of an oligonucleotide 
derivative in which the phosphodiester bond is converted into a phosphorothioate bond, an 
oligonucleotide derivative in which the phosphodiester bond is converted into anN3'~P5'- 
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phosphoamidate bond, an oligonucleotide derivative in which the ribose and the 
phosphodiester bond are converted into a peptide-nucleic acid bond, an oligonucleotide 
derivative in which the uracil is replaced by a C-5 propynyluracil, an oligonucleotide 
derivative in which the uracil is replaced by a C-5 thiazolyluracil, an oligonucleotide 
derivative in which the cytosine is replaced by a C-5 propynylcytosine, an oligonucleotide 
derivative in which the cytosine is replaced by a phenoxazine-modified cytosine, an 
oligonucleotide derivative in which the ribose is replaced by a 2'-0-propylribose, and an 
oligonucleotide derivative in which the ribose is replaced by a 2'-methoxyethoxyribose. 

The paragraph at page 1 1, lines 30-33 have been amended as follows: 
29. A method for determining the expression level of a gene encoding the 

polypeptide of any one of items 1 to 4, which comprises polymerase chain reaction using 

the [oligonucleotide] DNA of any one of items 26 to 28. 

The paragraph at page 17, lines 30-33 have been amended as follows: 
The expression vector into which the cDNA has been integrated is 

introduced into animal cells capable of [selecting] expressing the objective cDNA, to give 

transformed cells. 
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The paragraph at page 22, lines 7-26 have been amended as follows: 
The expression vector includes, e.g., pBTrp2, pBTacl, pBTac2 (which all 
are commercially available from Boehringer Mannheim), pKK233-2 (a product of 
Pharmacia), pSE280 (a product of Invitrogen), pGEMEX-1 (a product of Promega), pQE-8 
(a product of QIAGEN), pKYPIO (Japanese Published Unexamined Patent Application 
No 1 10600/83), pKYP200 [Agric. Biol. Chem , 48, 669 (1984)], pLSAl [Agric. Biol. 
Chem., 53, 277 (1989)], pGELl [Proc. Natl. Acad. Sci., USA, 82, 4306 (1985)], 
pBluescript II SK+ (a product of Stratagene), pBluescript II SK(-) (a product of 
Stratagene), P Trs30 (FERM BP-5407), P Trs32 (FERM BP-5408), pGHA2 (FERM BP- 
400), pGKA2 [(FERM B-6798)] (FERM BP-6798) , pTerm2 (Japanese Published 
Unexamined Patent Application No. 22979/91, US 4686191, US 4939094, US 5160735), 
pEG400 [J. Bacterid., 172, 2392 (1990)], pGEX (a product of Pharmacia), pET system (a 
product of Novagen), pSupex, pUBHO, pTP5, pC194, pTrxFus (a product of Invitrogen), 
P MAL-c2 (a product of New England Biolabs), pUC19 [Gene, 33, 103 (1985)], pSTV28 (a 
product of Takara Shuzo Co., Ltd.), pUC118 (a product of Takara Shuzo Co., Ltd.) and 
pPAl (Japanese Published Unexamined Patent Application No. 233798/88). 

The paragraph at page 38, lines 9-23 have been amended as follows: 
That is, N-acetylglucosamine monosaccharides, oligosaccharides having N- 
acetylglucosamine residue at the non-reducing termini thereof, or complex carbohydrates 
having N-acetylglucosamine residue at the non-reducing termini of sugar chains thereof, 
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are used as the acceptor substrate while the polypeptide of the present invention obtained in 
the method described in item (2) above is used as the enzyme source, and reaction products 
having galactose transferred via p 1,3 -linkage to N-acetylglucosamine monosaccharide or 
N-acetylglucosamine residue of the acceptor substrate can be produced by allowing said 
receptor substrate, said enzyme source and uridine-5 '-diphosphate galactose (UDP-Gal) to 
be present in an aqueous medium to thereby form and accumulate said reaction products in 
said aqueous medium and collecting said reaction products from said aqueous medium. 

The paragraph starting at page 44, line 36 and ending at page 45, line 7 has 
been amended as follows. 

The class and subclass of the antibody [is] are determined using a mouse 
monoclonal antibody typing kit or a rat monoclonal antibody typing kit. The class of the 
antibody means isotype of the antibody, and for example, mention can be made of IgG, 
IgA, IgMJgD and IgE in human. The subclass of the antibody means isotype in the class, 
and for example, mention can be made of IgGl, IgG2a, IgG2b and IgG3 in mouse, and 
IgGl, IgG2, IgG3 and IgG4 in human. 

The paragraph starting at page 54, line 29 and ending at page 55, line 4 has 
been amended as follows. 

Fig. 1, A shows the results of measurement of the expression levels of type 
1 sugar chains (sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis b sugar chain) in a 
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wide variety of human cancer ceil lines. Each kind of cell was stained with a fluorescent 
antibody using anti-sialyl-Lewis a sugar chain antibody (19-9), anti-Lewis a sugar chain 
antibody (7LE) or anti-Lewis b sugar chain antibody [(Neokokusai)] (TT42) and then 
analyzed by FACS. The reactivity with each antibody is shown as +++, ++, +, ±, and -, in 
the order of decreasing reactivity. - means the absence of reactivity with the antibody. 
NT means that the analysis was not conducted. 

The paragraphs at page 55, lines 15-32 have amended as follows. 

Fig. 3, A shows the results of indirect fluorescent antibody staining with 
anti-sialyl-Lewis c sugar chain antibody (DU-PAN-2) and subsequent FACS analysis of 
Namalwa cells [[Namalwa <mock^] (Namalwa-mock) having the control plasmid (pAMo) 
introduced therein, or Namalwa cells (Namalwa-3GT5) having human p3Gal-T5 
expression plasmid (pAMo-3GT5) introduced therein. The shaded histogram shows the 
result of analysis using A-PBS in place of DU-PAN-2. 

Fig. 3, B shows the results of indirect fluorescent antibody staining with anti-sialyl- 
Lewis a sugar chain antibody (19-9), anti-sialyl-Lewis c sugar chain antibody (DU-PAN-2), 
anti-Lewis a sugar chain antibody (7LE) or anti-Lewis b sugar chain antibody 
([Neokokusai] TT42) and subsequent FACS analysis ofHCT-15 cells [[HCT15 (mock)]] 
(HCT-mock) having the control plasmid (pAMo) introduced therein, or HCT-15 cells 
(HCT-3GT5H) having human p3Gal-T5 expression plasmid (pAMo-3GT5) introduced 
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therein. The shaded histogram shows the result of analysis using A-PBS in place of DU- 
PAN-2. 

The paragraph at page 56, lines 22-29 have been amended as follows: 
Fig. 6, C shows the results of examination of the expression level of each 
isoform of human (33Gal-T5 cDNA in Colo205 cells by the RT-PCR method. After RT- 
PCR was conducted with the combination of primers shown in Fig. 6, A, the reaction 
product was cleaved with a restriction enzyme (Xbal or BsmI) shown in Fig. 6, to specify 
the isoform. ["none"] 0 means that the restriction enzyme treatment was not conducted. 
The left line shows molecular markers (100 bp ladder). 

The paragraph at page 58, lines 8-13 have been amended as follows: 
The cells described above were cultured in a medium suitable for the 
respective cells, and then the cells were subjected to fluorescent antibody staining with 
anti-sialyl-Lewis a sugar chain antibody (19-9), anti-Lewis a sugar chain antibody (7LE) or 
anti-Lewis b sugar chain antibody ([Neokokusai] TT42) and analyzed by FACS. 

The paragraph at page 60, lines 16-19 have been amended as follows: 
As the internal controls, linear DNAs were prepared by cleaving plasmids 

prepared below ([pBS] pUC119 -3GTld, pBS-3GT2d, pBS-3GT3d, pBS-3GT4d) with 

suitable restriction enzymes to take the cDNA inserts. 
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The paragraph at page 60, lines 24-25 have been amended as follows: 
By deleting a 183-bp sequence between Styl-Styl in human (33Gal-T3 
cDNA in pBS-3GT3, p BS-3GT3d was prepared. 

The paragraph at page 71, lines 5-10 have been amended as follows: 
The single clone (HCT-3GT5H) thus obtained was subjected to indirect 
fluorescent antibody staining with anti-sialyl-Lewis a sugar chain antibody (19-9), anti- 
sialyl-Lewis c sugar chain antibody (DU-PAN-2, a product of Kyowa Medex), anti-Lewis a 
sugar chain antibody (7LE) or anti-Lewis b sugar chain antibody ([Neokokusai] TT42). 

The paragraph at page 73, lines 19-21 have been mended as follows: 
The respective cells (lxl 0 7 cells) were suspended in a solution [20 mmol/1 

HEPES (pH 7.2), 2% [Triron] Triton X-100] and sonicated in a short time to prepare a 

cell lysate. 

The paragraph at page 76 , lines 1-3 have been amended as follows: 

Said transformed cells were suspended in a solution [20 mmol/1 HEPES (pH 

7.2), 2% [Triron] Triton X-100] and sonicated in a short time to prepare a cell lysate 

solution. 

The paragraph at page 76, lines 14-19 have been amended as follows: 
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Attorney Docket No. 2139 25 

Specifically, the activity was measured by identifying a product by high 
performance liquid chromatography (HPLC) after reaction at 37°C for 2 hours in 10 ul 
assay solution [14 mmol/1 HEPES (pH 7.4), 75 umol/1 UDP-Gal (a product of SIGMA), 1 1 
umol/1 MnCl 2 , 0.01% [Triron] Triton X-100, 25 ^imol/1 pyridylaminated sugar chain 
substrate, and the above cell lysate solution]. 



The paragraphs at page 79, lines 12-27 have been amended as follows: 
The (33Gal-T5 transcripts in various human tissues (brain, lung, esophagus, 
stomach (body), stomach (antrum), jejunum, colon, liver, pancreas, spleen, kidney, adrenal, 
uterus, peripheral blood lymphocytes) were quantified by RT-PCR in the same manner as 
in item (7) in Example 2. The amount of the p3Gal-T5 gene transcript in each kind of 
organs is shown as a value relative to the amount (= 1000) of the (3-actin transcript (Fig. 5). 

It was revealed that the |33Gal-T5 transcripts are significantly expressed in 
the stomach (body), stomach (antrum), jejunum, colon and pancreas. Further, the p3Gal- 
T5 transcripts were slightly expressed in the brain, esophagus, kidney and uterus. On the 
other hand, the p3Gal-T5 transcripts were not expressed in the lung, liver, spleen, adrenal 
and peripheral blood lymphocytes. 



-viii- 



09/914152 
518 Recti PCT/Plt) 2 4 AUG 2001 



2139.25 PATENT APPLICATION 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Application of: ) 

: Examiner: Not Yet Assigned 
HISASHT NARIMATSU, ET AL. ) 

: Group Art Unit: N/Y/A 
Application No. : (National Phase of ) 
PCT/JPOO/01070) : 

) 

Filed: Currently herewith : 

) 

For: NOVEL POLYPEPTIDES : August 22, 2001 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

REQUEST FOR APPROVAL OF PROPOSED DRAWING CHANGES 

Sir: 

Applicants respectfully request the Examiner's approval of the following 
proposed changes to the drawings: 

In Fig. 5, bottom left, please change "Peripheral lymphocytes" to 
. -Peripheral blood lymphocytes--. 

For the Examiner's convenience, the changes are illustrated in red on the 
accompanying sheet of drawings. 

Entry hereof and an indication that the proposed changes have been 
approved are respectfully requested in the next Patent and Trademark Office 
communication. 



Applicants' undersigned attorney may be reached in our New York office by 
telephone at (212) 218-2100. All correspondence should continue to be directed to our 
below listed address. 



Respectfully submitted, 




Cawfehce S. Perry 
Attorney for Applicants 
Registration No. 31,865 



FITZPATRICK, CELLA, HARPER & SCINTO 
30 Rockefeller Plaza 
New York, New York 101 12-3801 
Facsimile: (212)218-2200 
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The present invention relates to a novel polypeptide 



having pi , 3-galactosyltransf erase activity involved in the 
synthesis of type 1 sugar chains such as sialyl-Lewis a sugar 
chain in cancer cells in the digestive system such as colon 
cancer cells, pancreatic cancer cells, etc. expressing 

10 sialyl-Lewis a sugar chain, a process for producing said 

polypeptide, a DNA coding for said polypeptide, a recombinant 
vector having said DNA integrated therein, a transformant 
carrying said recombinant vector, an antibody recognizing said 
polypeptide, a process for producing type 1 sugar chain- 

15 containing sugar chains and complex carbohydrates containing 
said sugar chains by use of said polypeptide, and a process 
for producing type 1 sugar chain-containing sugar chains and 
complex carbohydrates containing said sugar chains by use of 
a transformant carrying said recombinant vector. 



Background Art 

It is estimated that sugar chains are involved not only 
in life phenomena such as development, differentiation and cell 
recognition but also in occurrence and progress of 
25 inflammations, cancers, infections, auto-immune diseases and 
a number of other diseases [A. Kobata, S. Hakomori and K. Nagai: 
Glycobiology Series (1) to (6), Kodansha (1993), Glycobiology, 
3, 97 (1993)]. 



30 proteoglycans or glycolipids, in which they are added to 
proteins or lipids, but also as oligosaccharides. 

Sugar chains having Gal(3l-3GlcNAc structure are called 
type 1 sugar chains, and constitute core sugar chains of Lewis 
blood type antigens and cancer-related sugar chain antigens. 

35 The Lewis blood type antigens include not only Lewis a sugar 
chain [Gal|3l-3 (Fucal-4 ) GlcNAc] but also Lewis b sugar chain 



20 



Sugar chains exist not only as glycoproteins, 
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[Fucal-2Galp"l-3 (Fucal-4) GlcNAc] . Sialyl-Lewis a sugar chain 
[NeuAca2-3Gal|3l-3 (Fucal-4) GlcNAc] and sialyl-Lewis c sugar 
chain (NeuAc<x2-3Galpl-3GlcNAc) are cancer-related sugar 
chains detected highly frequently in cancers in mainly the 
5 digestive system such as colon cancers, pancreatic cancers, 
etc., and antibodies against sialyl-Lewis a sugar chain and 
sialyl-Lewis c sugar chain are utilized for serodiagnosis of 
cancers . 

It is revealed that adhesion of adhesion molecule 

10 selectins (E-, P- and L-selectins) and their sugar chain 

ligands (sialyl-Lewis x sugar chains or their related sugar 
chains) is involved in accumulation of leukocytes in 
inflammatory sites and homing of lymphocytes to lymph nodes. 
Because sialyl-Lewis a sugar chain, i.e. structural 

15 isomer of sialyl-Lewis x sugar chain [NeuAca2-3Gal|3l - 

4 (Fucal-3) GlcNAc] binds to selectins, sialyl-Lewis a sugar 
chain is considered to participate in cancer metastasis. 
Further, it is reported that the expression level of 
sialyl-Lewis a sugar chain in colon cancers and pancreatic 

20 cancers is correlated with poor prognosis of cancers. 

The Gal(3l-3GlcNAc structure is synthesized by a GlcNAc 
(31, 3-galactosyltransf erase . To date, genes of three GlcNAc 
(31, 3-galactosyltransf erases (|33Gal-Tl, 03Gal-T2, (33Gal-T3) 
have been cloned, and the acceptor substrate specificity of 

25 each enzyme has been analyzed [Japanese Published Unexamined 
Patent Application No. 181759/94, J. Biol. Chem. , 273 , 58- 
65 (1998), J. Biol. Chem., 272, 433-440 (1998), J. Biol. Chem., 
27 3 r 12770-12778 (1998)]. Further, another (31 , 3- 
galactosyltransf erase (|33Gal-T4) having different substrate 

30 specificity has been cloned [J. Biol. Chem., 222, 24794-24799 
(1997), J. Biol. Chem., 223., 12770-12778 (1998)]. (33Gal-T4 
synthesizes ganglioside GAl, GM1 or GDlb, but not the 
Gal|3l~3GlcNAc structure. 

If GlcNAc (31, 3-galactosyltransf erase involved in the 

35 synthesis of sialyl-Lewis a sugar chain and sialyl-Lewis c 
sugar chain as cancer-related sugar chains can be identified 
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in cancers in the digestive system such as colon cancers, 
pancreatic cancers, etc., more accurate diagnosis of cancers 
would be feasible by examining said enzyme or the expression 
level of a gene of said enzyme. Further, it is expectable that 
5 cancer metastasis could be inhibited by regulating the activity 
of said enzyme or the transcription and translation of said 
enzyme gene. However, neither said enzyme nor said enzyme gene 
has been identified. A GlcNAc pi , 3-galactosyltransf erase has 
been partially purified from colon cancer cell line Colo205, 

10 but nobody has achieved isolation of said enzyme, determination 
of the amino acid sequence of said enzyme, or isolation of a 
gene of said enzyme [J. Biol. Chem. , Z62, 15649-15658 (1987), 
Archi. Biochem. Biophys. 270, 630-646 (1989), Archi. Biochem. 
Biophys. 274, 14-25 (1989)]. 

15 a sugar chain having an ability to bind to selectins is 

useful as a selectin antagonist to treat and prevent 
inflammations and cancer metastasis. Accordingly, it is 
estimated that (31 , 3-galactosyltransf erases involved in the 
synthesis of sialyl-Lewis a sugar chains in cancers in 

20 digestive system such as colon cancers and pancreatic cancers 
is also applicable to efficient synthesis of selectin 
antagonists . 

It is known that various oligosaccharides occur in human 
milk [Acta Paediatrica, 82., 903 (1993)]. Lacto-N-tetraose 
25 (Galpl-3GlcNAcpl-3GlcNAc(3l-4Glc) is contained in human milk 
and estimated to prevent infants from being infected with 
viruses or microorganisms . Further, lacto-N-tetraose has the 
activity of promoting the growth of bifidobacterium as benign 
enteric bacterium. On the other hand, there are few types of 
30 oligosaccharides occurring in milk from animals such as cattle, 
mice, etc., and a majority thereof are lactose, and 
trisaccharides or higher oligosaccharides scarcely occur 
[Acta Paediatrica, 82, 903 (1993), J. Biol. Chem., 270, 29515 
(1995) ] . 

35 it would be considered significantly advantageous in 

industry if we could produce efficiently various 
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oligosaccharides having lacto-N-tetraose as a back born. It 
is therefore an industrially important task to develop an 
enzyme having a higher activity of synthesizing lacto-N- 
tetraose than the activity of GlcNAc 01,3- 
5 galactosyltransf erase so far cloned. 

Disclosure of Invention 

The object of the present invention is to provide a 
pharmaceutical preparation for anti-inflammations, anti- 

10 infections or inhibition of cancer metastasis, foods such as 
dairy products, a method of improving proteins, and a method 
for diagnosis of diseases such as cancers, by utilizing a novel 
polypeptide having 01 , 3-galactosyltransf erase . 
The present invention relates to: 

15 1. A polypeptide having 01, 3-galactosyltransf erase 

activity involved in the synthesis of sialyl-Lewis a sugar 
chain, present in colon cancer cells expressing sialyl-Lewis 
a sugar chain. 

The novel polypeptide of the present invention having 

20 01, 3-galactosyltransf erase activity occurs not only in colon 
cancer cells expressing sialyl-Lewis a sugar chain, but also 
in cancer cells in the digestive system, such as colon cancer 
cells, pancreatic cancer cells, etc. expressing type 1 sugar 
chains such as sialyl-Lewis c sugar chain, Lewis a sugar chain, 

25 Lewis b sugar chain, etc. A polypeptide having 01,3- 
galactosyltransf erase activity involved in efficient 
synthesis of type 1 sugar chains present in these cancer cells 
in the digestive system is also the polypeptide of the present 
invention. The polypeptide of the present invention is a novel 

30 01, 3-galactosyltransf erase different from known 03Gal-Tl, 
03Gal-T2 and 03Gal-T3. 

2. A polypeptide selected from the group consisting of: 
(a) a polypeptide consisting of the amino acid sequence 
represented by SEQ ID NO:l, 

35 (b) a polypeptide containing the amino acid sequence of 

31 to 310 in the amino acid sequence represented by SEQ ID NO: 
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1 , and 

(c) a polypeptide consisting of an amino acid sequence 
where in the amino acid sequence of the polypeptide (a) or (b) , 
one or more amino acids have been deleted, replaced or added 
5 and having (51 , 3-galactosyltransf erase activity capable of 
synthesizing Gal(3l-3GlcNAc structure. 

The polypeptide consisting of an amino acid sequence 
where in the amino acid sequence of the polypeptide (a) or (b) , 
one or more amino acids have been deleted, replaced or added 

10 and having the pi, 3-galactosyltransf erase activity of the 
polypeptide (a) or (b) capable of synthesizing Galpl-3GlcNAc 
structure can be obtained by, e.g., site-directed mutagenesis 
of DNA coding for a polypeptide having, e.g., the amino acid 
sequence represented by SEQ ID NO: 1 by using site-directed 

15 mutagenesis methods described in Molecular Cloning, A 
laboratory Manual, Second Edition, Cold Spring Harbor 
Laboratory Press (1989) (abbreviated hereinafter to Molecular 
Cloning, 2nd edition) , Current Protocols in Molecular Biology, 
John Wiley & Sons (1987-1997) (abbreviated hereinafter to 

20 Current Protocols in Molecular Biology) , Nucleic Acids 

Research, 10, 6487 (1982), Proc. Natl. Acad. Sci., USA, 29, 
6409 (1982), Gene, 3_4, 315 (1985), Nucleic Acids Research, 13., 
4431 (1985), Proc. Natl. Acad. Sci., USA, 82, 488 (1985), etc. 

The number of amino acids deleted, replaced or added, 

25 which is the number of amino acids capable of being deleted, 
replaced or added by known methods such as site-directed 
mutagenesis, etc. described above, is not particularly limited, 
but it is preferably 1 to several decades, preferably 1 to 20, 
more preferably 1 to 10 and most preferably 1 to 5 . 

30 To achieve the pi , 3-galactosyltransf erase activity 

capable of synthesizing Galpl-3GlcNAc structure, the homology 
of the polypeptide of the present invention with the amino acid 
sequence represented by SEQ ID NO: 1, as calculated using BLAST 
[J. Mol. Biol. , 215 , 403 (1990) ] or FAST A [Methods in Enzymology, 

35 183 , 63-98 (1990)] (calculation means, method, etc. are 

described for defining of the % homology) , is 60% or more, 
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preferably 80% or more, and more preferably 95% or more. 

The polypeptide consisting of an amino acid sequence 
where in the amino acid sequence of the polypeptide (a) or (b) , 
one or more amino acids have been deleted, replaced or added 
5 and having pi , 3-galactosyltransf erase activity capable of 
synthesizing Gaipi-3GlcNAc structure is a novel pi, 3- 
galactosyltransf erase different from known |33Gal-Tl, p3Gal-T2 
and |33Gal-T3. 

3. A polypeptide according to item 1 or 2 wherein the 
10 pi, 3-galactosyltransf erase activity is the activity of 

transferring galactose via pi, 3-linkage to N- 
acetylglucosamine residue present at the non-reducing 
terminus of a sugar chain. 

4. A polypeptide according to item 1 or 2 wherein the 
15 pi, 3-galactosyltransferase activity is the activity of 

transferring galactose via pi, 3-linkage to N- 
acetylglucosamine residue present at the non-reducing 
terminus of GlcNAcpi-3Gaipi-4Glc or to N-acetylglucosamine 
monosaccharide . 
2 0 5. A DNA selected from the group consisting of: 

(a) DNA coding for the polypeptide described in any one 
of items 1 to 4, 

(b) DNA having the nucleotide sequence of 402 to 1331 
in the nucleotide sequence represented by SEQ ID NO: 2, 

25 (c) DNA having the nucleotide sequence of 492 to 1331 

in the nucleotide sequence represented by SEQ ID NO: 2, and 
(d) DNA hybridizing under stringent conditions with the 
DNA described in any of (a) to (c) and coding for a polypeptide 
having pi , 3-galactosyltransf erase activity capable of 

30 synthesizing Gaipi-3GlcNAc structure. 

The "DNA capable of hybridizing under stringent 
conditions" described above refers to a DNA obtained by colony 
hybridization, plaque hybridization, Southern blot 
hybridization, or the like, with a DNA selected as the probe 

35 from the group consisting of DNAs in items (a) , (b) and (c) 
described above. A specific example includes a DNA which can 
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be identified by carrying out hybridization at 65°C in the 
presence of 0.7 to 1.0 mol/1 NaCl using a filter on which a 
DNA prepared from colonies or plaques is immobilized and then 
washing the filter at 65°C with a 0.1 to 2-fold cone. SSC 
5 (saline-sodium citrate) solution (1-fold cone. SSC solution 
is composed of 150 mmol/1 sodium chloride and 15 mmol/1 sodium 
citrate) . 

Hybridization can be effected according to a method 
described in books on experiments such as Molecular Cloning, 

10 Second Edition; and Current Protocols in Molecular Biology, 
Supplements lto38; DNA cloning 1 : Core Techniques , A Practical 
Approach, Second Edition, Oxford University Press (1995) . 

Specifically, the DNA capable of hybridizing, when 
calculated using BLAST [J. Mol. Biol., 215_, 403 (1990) ] or FAST A 

15 [Methods in Enzymology, 183, 63-98 (1990) ] , includes DNA having 
60% or more homology, preferably 80% or more homology, more 
preferably 95% or more homology with a nucleotide sequence 
coding for a polypeptide having the amino acid sequence 
represented by SEQ ID NO: 1. The polypeptide encoded by said 

20 DNA and having pi , 3-galactosyltransf erase activity capable of 
synthesizing the Gaipi-3GlcNAc structure is a novel pi, 3- 
galactosyltransferase different from known p3Gal-Tl, (53Gal-T2 
and p3Gal-T3. 

6. A recombinant DNA prepared by integrating the DNA 
25 described in item 5 into a vector. 

7 . A recombinant DNA according to item 6 which is plasmid 
pAMo-3GT5 or plasmid pBS-3GT5 (FERM BP-6645) . 

8. A transformant harboring the DNA described in item 
5, the recombinant DNA in item 6 or the recombinant DNA in item 

30 7 . 

9. A transformant according to item 8 which is a member 
selected from the group consisting of a microorganism, an 
animal cell, a plant cell, an insect cell, a non-human 
transgenic animal and a transgenic plant. 

35 10. A transformant according to item 9 wherein the 

microorganism is a microorganism belonging to the genus 
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Escherichia . 

11 . A transf ormant according to item 9 wherein the animal 
cell is a member selected from the group consisting of a mouse 
myeloma cell, a rat myeloma cell, a mouse hybridoma cell, a 

5 CHO cell, a BHK cell, an African green monkey kidney cell, a 
Namalwa cell, a Namalwa KJM-1 cell, a human embryonic kidney 
cell and a human leukemia cell. 

12 . A transf ormant according to item 9 wherein the insect 
cell is a member selected from the group consisting of a 

10 Spodoptera frugiperda ovarian cell, a Trichoplusia ni ovarian 
cell and a silkworm ovarian cell. 

13. A process for producing the polypeptide described 
in any one of items 1 to 4, which comprises culturing a 
transformant harboring a recombinant DNA prepared by 

15 integrating DNA coding for the polypeptide of any one of items 
1 to 4 into a vector in a medium to thereby form and accumulate 
said polypeptide in the culture, and collecting said 
polypeptide from said culture. 

14. A process for producing the polypeptide described 
20 in any one of items 1 to 4, which comprises breeding a non-human 

transgenic animal harboring a recombinant DNA prepared by 
integrating DNA coding for the polypeptide of any one of items 
1 to 4 into a vector to thereby form and accumulate said 
polypeptide in said animal, and collecting said polypeptide 
25 from said animal. 

15. A process according to item 14 wherein formation and 
accumulation occur in animal milk. 

16. A process for producing the polypeptide described 
in any one of items 1 to 4, which comprises culturing a 

30 transgenic plant harboring a recombinant DNA prepared by 

integrating DNA coding for the polypeptide of any one of items 
1 to 4 into a vector to thereby form and accumulate said 
polypeptide in said plant, and collecting said polypeptide from 
said plant. 

35 17. A process for producing the polypeptide described 

in any one of items 1 to 4 , which comprises synthesizing the 



polypeptide of any one of items 1 to 4 in an in vitro 
transcription and translation system using DNA coding for said 
polypeptide . 

18. A process for producing a reaction product having 
galactose, which comprises using the polypeptide of any one 
of items 1 to 4 as an enzyme source, and allowing 

(a) said enzyme source, 

(b) an acceptor substrate selected from the group 
consisting of: 

i) N-acetylglucosamine (GlcNAc) , 

ii) an oligosaccharide having N-acetylglucosamine 
residue at the non-reducing terminus thereof, and 

iii) a complex carbohydrate having N-acetylglucosamine 
residue at the non-reducing terminus thereof, and 

(c) uridine-5' -diphosphate galactose to be present in 
an aqueous medium to thereby form and accumulate said reaction 
product in the aqueous medium, and collecting said reaction 
product from said aqueous medium, wherein the galactose is 
transferred via |51 , 3-linkage to N-acetylglucosamine or N- 
acetylglucosamine residue of said acceptor substrate. 

19. A process for producing a reaction product having 
galactose, which comprises using the polypeptide of any of 
items 1 to 4 as an enzyme source, and allowing 

(a) said enzyme source, 

(b) an acceptor substrate selected from the group 
consisting of: 

i) glucose, 

ii) an oligosaccharide having glucose residue at the 
non-reducing terminus thereof, and 

iii) a complex carbohydrate having glucose residue at 
the non-reducing terminus thereof, and 

(c) uridine-5' -diphosphate galactose to be present in 
an aqueous medium to thereby form and accumulate said reaction 
product in the aqueous medium, and collecting said reaction 
product from said aqueous medium, wherein the galactose is 
transferred via (31 , 3-linkage to glucose or glucose residue of 
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said acceptor substrate. 

20. A process for producing a sugar chain or a complex 
carbohydrate, which comprises culturing the transformant 
selected from the group consisting of transf ormants of item 
9 derived from a microorganism, an animal cell, a plant cell 
and an insect cell in a medium to thereby form and accumulate 
a sugar chain having galactose transferred via (31,3-linkage 
to N-acetylglucosamine, N-acetylglucosamine residue, glucose 
or glucose residue thereof or a complex carbohydrate containing 
said sugar chain in the culture, and collecting said sugar chain 
or said complex carbohydrate from said culture. 

21. A process for producing a sugar chain or a complex 
carbohydrate, which comprises breeding the non-human 
transgenic animal of item 9 to thereby form and accumulate in 
said animal a sugar chain having galactose transferred via 
pl,3-linkage to N-acetylglucosamine, N-acetylglucosamine 
residue, glucose or glucose residue thereof or a complex 
carbohydrate containing said sugar chain, and collecting said 
sugar chain or said complex carbohydrate from said animal. 

22. A process for producing a sugar chain or a complex 
carbohydrate, which comprises culturing the transgenic plant 
of item 9 to thereby form and accumulate in said plant a sugar 
chain having galactose transferred via (31 , 3-linkage to N- 
acetylglucosamine, N-acetylglucosamine residue, glucose or 
glucose residue thereof or a complex carbohydrate containing 
said sugar chain, and collecting said sugar chain or said 
complex carbohydrate from said plant. 

23. A process according to any of items 18 to 22 wherein 
the complex carbohydrate is a complex carbohydrate selected 
from the group consisting of a glycoprotein, a glycolipid, a 
proteoglycan, a glycopeptide, a lipopolysaccharide, a 
peptidoglycan and a glycoside which is a steroid compound with 
a sugar chain. 

24 . A process according to item 21 wherein formation and 
accumulation occur in animal milk. 

25. A method for determining the expression level of a 
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gene encoding the polypeptide of any one of items 1 to 4, which 
comprises hybridization using DNA coding for said polypeptide . 

26. A DNA selected from the group consisting of an 
oligonucleotide having the same nucleotide sequence as a 
consecutive 5- to 60-nucleotide seguence in the nucleotide 
sequence of the DNA of item 5 or of a DNA having the nucleotide 
sequence represented by SEQ ID NO: 2 or 3, an oligonucleotide 
having a sequence complementary to said oligonucleotide, and 
an oligonucleotide derivative of any of said oligonucleotides . 

27. An oligonucleotide according to item 26 wherein the 
oligonucleotide derivative is selected from the group 
consisting of an oligonucleotide derivative in which the 
phosphodiester bond is converted into a phosphorothioate bond, 
an oligonucleotide derivative in which the phosphodiester bond 
is converted into an N3' -P5' -phosphoamidate bond, an 
oligonucleotide derivative in which the ribose and the 
phosphodiester bond are converted into a peptide-nucleic acid 
bond, an oligonucleotide derivative in which the uracil is 
replaced by a C-5 propynyluracil, an oligonucleotide 
derivative in which the uracil is replaced by a C-5 
thiazolyluracil, an oligonucleotide derivative in which the 
cytosine is replaced by a C-5 propynylcytosine, an 
oligonucleotide derivative in which the cytosine is replaced 
by a phenoxazine-modif ied cytosine, an oligonucleotide 
derivative in which the ribose is replaced by a 2'-0- 
propylribose, and an oligonucleotide derivative in which the 
ribose is replaced by a 2 ' -methoxyethoxyribose . 

28. A DNA that has a nucleotide sequence represented by 
SEQ ID NO: 20 or 21. 

29. A method for determining the expression level of a 
gene encoding the polypeptide of any one of items 1 to 4, which 
comprises polymerase chain reaction using the oligonucleotide 
of any one of items 26 to 28. 

30 . Amethod for detecting cancers and cancer metastasis, 
which comprises using the method of item 25 or 29. 

31. Amethod for inhibiting transcription of DNA coding 
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for the polypeptide of any one of items 1 to 4 or translation 
of its corresponding mRNA, which comprises using a DNA selected 
from DNAs of items 5 and 26 to 28 and DNAs having a nucleotide 
sequence represented by SEQ ID NO: 2 or 3 . 

32. An antibody recognizing the polypeptide of any one 
of items 1 to 4 . 

33. A method for immunological detection of the 
polypeptide of any one of items 1 to 4, which comprises using 
the antibody of item 32. 

34. An immunohistostaining method, which comprises 
detecting the polypeptide of any one of items 1 to 4 by using 
the antibody of item 32. 

35. An immunohistostaining agent comprising the 
antibody of item 32. 

36 . A diagnostic reagent for cancers or cancer metastasis, 
which comprises the antibody of item 32. 

37 . A method for screening a compound varying the 
activity of the polypeptide of any one of items 1 to 4, which 
comprises contacting said polypeptide with a test sample. 

38. A method for screening a compound varying the 
expression of a gene coding for the polypeptide of any one of 
items 1 to 4, which comprises contacting cells expressing said 
polypeptide with a test sample and determining the content of 
sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis b sugar 
chain or sialyl-Lewis c sugar chain by use of anti-sialyl- 
Lewis a antibody, anti-Lewis a antibody, anti-Lewis b antibody 
or anti-sialyl-Lewis c antibody. 

39. A method for screening a compound varying the 
expression of a gene coding for the polypeptide of any one of 
items 1 to 4, which comprises contacting cells expressing said 
polypeptide with a test sample and determining the content of 
said polypeptide by use of the antibody of item 32. 

40. A promoter DNA governing transcription of a gene 
coding for the polypeptide described in any one of items 1 to 
4 . 

41. A promoter DNA according to item 40, which works in 
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cells selected from the group consisting of small intestine 
cells, large intestine cells, pancreas cells, stomach cells, 
colon cancer cells, pancreatic cancer cells and stomach cancer 
cells . 

42. A promoter DNA according to item 40 or 41, which is 
a human- or mouse-derived promoter DNA. 

43. A promoter DNA according to any one of items 4 0 to 
42, which comprises a 50- to 5000-bp consecutive nucleotide 
DNA sequence in the nucleotide sequence of 1 to 5000 in the 
nucleotide sequence represented by SEQ ID NO: 3. 

44. A method for screening a compound varying the 
efficiency of transcription by the promoter DNA of any one of 
items 40 to 43, which comprises transforming animal cells with 
a plasmid containing said promoter DNA and a reporter gene 
ligated downstream of said promoter DNA, then contacting the 
transformant with a test sample, and determining the content 
of a translation product of said reporter gene. 

45. A screening method according to item 44 wherein the 
reporter gene is a gene selected from the group consisting of 
a chloramphenicol acetyltransf erase gene, a ^-glucuronidase 
gene, a (3-galactosidase gene, a ^-lactamase gene, a luciferase 
gene, an aequorin gene and a green fluorescent protein gene. 

4 6. A knockout non-human animal wherein a DNA coding for 
the polypeptide of any one of items 1 to 4 is rendered defective 
or mutated. 

47. A knockout non-human animal according to item 46 
wherein the knockout non-human animal is a mouse. 

Hereinafter, the present invention is described in 
detail . 

(1) Acquisition of DNA coding for the novel polypeptide of the 
present invention having pi , 3-galactosyltransf erase activity 
capable of synthesizing type 1 sugar chains such as 
sialyl-Lewis a sugar chain (hereinafter also called novel 
pi, 3-galactosyltransferase gene) and production of said DNA 
and oligonucleotides 
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A cDNA library is prepared in a usual manner from 
sialyl-Lewis a sugar chain- or sialyl-Lewis c sugar chain- 
expressing cancer cells in the digestive system, such as colon 
cancer cells or pancreatic cancer cells. 

The method of preparing the cDNA library includes methods 
described in Molecular Cloning, 2nd edition, Current Protocols 
in Molecular Biology, Supplements 1-38, A Laboratory Manual, 
2nd Ed. (1989), DNA Cloning 1: Core Techniques, A Practical 
Approach, Second Edition, Oxford University Press (1995) or 
methods using commercially available kits such as Superscript 
Plasmid System for cDNA Synthesis and Plasmid Cloning (a 
product of Gibco BRL) and ZAP-cDNA Synthesis Kit (a product 
of Stratagene) . 

Examples of the sialyl-Lewis a sugar chain- or 
sialyl-Lewis c sugar chain-expressing cancer cells in the 
digestive system, such as colon cancer cells or pancreatic 
cancer cells useful in the present invention include human 
colon cancer cell lines Colo205, Colo201 and SW1116 expressing 
sialyl-Lewis a sugar chains and human pancreatic cancer cell 
line Capan-2 expressing sialyl Lewis a sugar chains. 

The cloning vector for preparing the cDNA library may 
be any phage vectors, plasmid vectors, etc. insofar as they 
can be autonomously replicated in E. coli K12. 

Specific examples include ZAP Express [Strategies, 5, 
58 (1992), a product of Stratagene], pBluescript II SK(+) 
[Nucleic Acids Research, 17, 9494 (1989)], XZAP II (a product 
of Stratagene) , XgtlO, Xgtll [DNA Cloning, A Practical Approach, 
1, 49 (1985)], XTriplEx (a product of Clonetech) , XExCell (a 
product of Pharmacia) , pT7T318U (a product of Pharmacia) , pcD2 
[Mol. Cell. Biol., 3, 280 (1983)], P UC18 [Gene, 33, 103 (1985)] 
and pAMo [J. Biol. Chem. , 268, 22782-22787 (1993), also called 
pAMoPRC3Sc (Japanese Published Unexamined Patent Application 
No. 336963/93) ] . 

The host microorganisms may be any microorganisms 
belonging to Escherichia coli. Specifically, Escherichia 
coli XLl-Blue MRF' [Strategies, 5_, 81 (1992), a product of 



Stratagene] , Escherichia coli C600 [Genetics, 39, 440 (1954) ] , 
Escherichia coli Y1088 [Science, 222, 778 (1983)], Escherichia 
coli Y1090 [Science, 222, 778 (1983)], Escherichia coli NM522 
[J. Mol. Biol., 166, 1 (1983)], Escherichia coli K802 [J. Mol. 
Biol., 16, 118 (1966)], Escherichia coli JM105 [Gene, 38, 275 
(1985)], Escherichia coli SOLR™ Strain [commercially 
available from Stratagene] , E. coli LE392 (Molecular Cloning, 
2nd ed.), etc. can be used. 

The cDNA library includes a cDNA library prepared, e.g. , 
in the following manner. 

A cDNA is synthesized with a cDNA synthesis system (a 
product of GIBCO BRL) using mRNA derived from human colon cancer 
cell line Colo205. 

A plasmid is prepared by adding an Sfjl linker to both 
termini of the DNA and then inserting the resulting DNA into 
Sfil sites of cloning vector pAMo. 

The plasmid is used to transform E. coli LE392 to prepare 
a cDNA library. 

From the prepared cDNA library, a clone containing the 
objective DNA is selected in the following manner. 

From the cDNA library prepared above, plasmids are 
prepared in a usual manner or by using a plasmid maxi kit 
(Product No. 41031), that is, a plasmid preparation kit 
produced by Qiagen. 

By comparing the amino acid sequences of four known 
01, 3-galactosyltransf erases, two or more well-conserved 
regions among the four pi , 3-galactosyltrans f erases are found. 

Degenerate primers each having a DNA sequence 
corresponding to the amino acid sequence of each region are 
designed according to known methods [Carl W. Dieffenbach, 
Gabriela S. Dveksler, "PGR Primer: A Laboratry Manual", Cold 
Spring Harbor Lab. (1995) , The Protocol Series " cDNA Cloning" 
ed. Jyunichiro Inoue & Kentaro Senba, Yodosha (1996) , Science, 
241 , 42 (1988)], and polymerase chain reaction (hereinafter 
abbreviated to PCR) [Molecular Cloning, 2nd edition and PCR 
Protocols Academic Press (1990)] is conducted using the cDNA 



library prepared above as a template, and the amplified 
fragment is subcloned into a suitable plasmid. 

Subcloning of the PCR-amplif ied fragment can be 
performed in a usual manner by integrating the amplified 
fragment, as such or after treatment with restriction enzymes 
or DNA polymerase into a vector. 

The vector includes pBluescript II SK( + ) and pBluescript 
SK(-) (both available from Stratagene) , pDIRECT [Nucleic Acids 
Research, 1£, 6069 (1990)], pCR-Script Amp SK(+) [a product 
of Stratagene, Strategies, 5_, 6264 (1992)], pT7Blue [a product 
of Novagen] , pCR II [a product of Invitrogen, Biotechnology, 
9, 657 (1991)], pCR-TRAP [a product of Genehunter] , pNoTA T7 (a 
product of 5' -» 3' Ltd.), etc. 

By determining the nucleotide seguence of the subcloned 
PCR-amplif ied fragment, a DNA fragment coding for an amino acid 
sequence that is homologous to, but not completely consistent 
with, the amino acid sequence of known fJl,3- 
galactosyltransferase is selected. The nucleotide sequence 
of the DNA can be determined by conventional nucleotide 
sequence analysis using the dideoxy method of Sanger, et al. 
[Proc. Natl. Acad. Sci. USA, 74, 5463 (1977)] or a nucleotide 
sequencer such as 373A-DNA sequencer (a product of Perkin 
Elmer) . 

By subjecting the cDNA library prepared above to colony 
hybridization or plaque hybridization (Molecular Cloning, 2nd 
ed.) with said DNA fragment as the probe, cDNA coding for a 
polypeptide having homology to known (31,3- 

galactosyltransferases can be obtained. As the probe, said 
DNA fragment labeled with an isotope or digoxigenin can be used. 

The DNA obtained in the method described above, as such 
or after cleavage with appropriate restriction enzymes, is 
inserted into a vector by a conventional method, and then the 
nucleotide sequence of the DNA can be determined by 
conventional nucleotide sequence analysis using the dideoxy 
method of Sanger, et al. [Proc. Natl. Acad. Sci. USA, 74, 5463 
(1977)] or a nucleotide sequencer such as 373A-DNA sequencer 
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(a product of Perkin Elmer) . 

The DNA obtained by said method includes, e.g., DNA 
coding for the polypeptide represented by SEQ ID NO: 1, and 
specifically, DNA having the nucleotide sequence represented 
by SEQ ID NO: 2 or 3 can be mentioned. 

The plasmid containing the DNA represented by SEQ ID NO: 
2 includes, e.g., pAMo-3GT5 and pBS-3GT5 (FERM BP-6645) 
described in the Examples below. 

The DNA obtained in the manner described above is 
integrated into an expression vector to construct an expression 
plasmid. After the resulting expression plasmid is introduced 
into suitable animal cells, whether said DNA codes for 
pi, 3-galactosyltransferase participating in the synthesis of 
sialyl-Lewis a sugar chain or sialyl-Lewis c sugar chain can 
be examined by analysis with a fluorescence activated cell 
sorter (hereinafter abbreviated to FACS) using anti- 
sialyl-Lewis a sugar chain antibody or anti-sialyl-Lewis c 
sugar chain antibody. 

As said expression vector, an vector capable of 
integrating said cDNA therein and expressing it in animal cells 
can be used, and for example, pcDNAI/Amp, pcDNAI, pCDM8 (which 
all are available from Funakoshi K. K.), pAGE107 [Japanese 
Published Unexamined Patent Application No. 22979/91, 
Cytotechnology, 2, 133 (1990)], pREP4 (a product of Invitrogen) , 
pAGE103 [J. Biochem., 101, 1307 (1987)], pAMo, pAMoA [J. Biol. 
Chem., 268, 22782-22787 (1993), also called pAMoPRSA (Japanese 
Published Unexamined Patent Applicaiton No. 336963/93], 
pAS3-3 (Japanese Published Unexamined Patent Application No. 
227075/90) , etc. can be mentioned. 

The expression vector into which the cDNA has been 
integrated is introduced into animal cells capable of selecting 
the objective cDNA, to give transformed cells. 

The method of introducing said expression vector may be 
any method of introducing DNA into animal cells, and mention 
can be made of, e.g., the electroporation method 
[Cytotechnology, 3, 133 (1990)], the calcium phosphate method 
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(Japanese Published Unexamined Patent Application No. 
227075/90) , the lipofection method [Proc. Natl. Acad. Sci. USA, 
84 . 7413 (1987)] and the method described in Virology, 52, 456 
(1973) . 

The animal cells include Namalwa cells that are human 
cells, Namalwa KJM-1 cells that are a sub-line of Namalwa cells, 
COS cells that are simian cells, CHO cells that are Chinese 
hamster cells, HBT5637 (Japanese Published Unexamined Patent 
Application No. 299/88) and HCT-15 that is a colon cancer cell 
line, among which Namalwa cells, Namalwa KJM-1 cells and HCT-15 
are preferable. 

The resulting transformed cells are cultured in a usual 
manner . 

Specifically, mention can be made of the following method 
of culturing transf ormants . 

When the transf ormants are animal cells, the medium for 
culturing the cells is a generally used medium such as RPMI1640 
medium [The Journal of the American Medical Association, 1M, 
519 (1967)], Eagle's MEM medium [Science, 122, 501 (1952)], 
DMEM medium [Virology, &, 396 (1959)], 199 medium [Proceeding 
of the Society for the Biological Medicine, 23, 1 (1950)] or 
any of these media further supplemented with fetal calf serum. 

The culturing is conducted usually for 1 to 7 days at 
pH 6 to 8, at 25 to 40°C in the presence of 5% C0 2 . 

If necessary, antibiotics such as kanamycin, penicillin 
and streptomycin may be added to the medium during the 
culturing. 

The cells obtained in the culturing are fluorescent ly 
stained with anti-sialyl-Lewis a sugar chain antibody or 
anti-sialyl-Lewis c sugar chain antibody and then analyzed by 
FACS, to examine whether the amount of sialyl-Lewis a sugar 
chains or sialyl-Lewis c sugar chains in the cells transformed 
with said expression plasmid is increased or not. If the 
amount of sialyl-Lewis a sugar chains or sialyl-Lewis c sugar 
chains is increased, said DNA can be estimated to code for novel 
pi, 3-galactosyltransf erase involved in the synthesis of 
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sialyl-Lewis a sugar chain or sialyl-Lewis c sugar chain. 

Any antibodies reacting with sialyl-Lewis a sugar chain 
or sialyl-Lewis c sugar chain can be used as anti-sialyl-Lewis 
a sugar chain antibody or anti-sialyl-Lewis c sugar chain 
5 antibody, respectively, and examples thereof include 19-9 (a 
product of Fujirebio) and KM231 (a product of Kyowa Medex) as 
anti-sialyl-Lewis a sugar chain antibodies and DU-PAN-2 (a 
product of Kyowa Medex) as anti-sialyl-Lewis c sugar chain 
antibody . 

10 In the manner described above, it is possible to obtain 

DNA coding for the novel polypeptide having (31,3- 
galactosyltransferase activity, participating in the 
synthesis of cancer-related sugar chains belonging to type 1 
sugar chains such as sialyl-Lewis a sugar chain in cancer cells 
15 in the digestive system, such as colon cancer cells, pancreatic 
cancer cells, etc. 

By selecting DNA hybridizing under stringent conditions 
with the DNA obtained in the method described above, it is 
possible to obtain the objective DNA coding for a polypeptide 
20 having an amino acid sequence where in SEQ ID NO: 1, one or 
more amino acids have been replaced, deleted or added. 

That is, the objective DNA can be obtained by screening 
a cDNA library derived from non-human animals such as mouse, 
rat, cattle and monkey. 
25 On the basis of the determined amino acid sequence of 

the novel pi, 3-galactosyltransf erase polypeptide, the 
objective DNA can also be prepared by chemically synthesizing 
DNA coding for said polypeptide. Chemical synthesis of the 
DNA can be carried out with a DNA synthesizer using the 
30 thiophosphite method (a product of Shimadzu Corporation) , a 
DNA synthesizer model 392 using the phosphoamidite method (a 
product of Perkin Elmer), or the like. 

Further, the objective DNA can also be prepared by PCR 
where oligonucleotides described below are used as sense primer 
35 and antisense primer while cDNA prepared from mRNA in cells 
expressing mRNA complementary to said DNA is used as a template . 
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The DNA and DNA fragment of the present invention 
obtained in the above-described methods can be used in the 
conventional method described in Molecular Cloning, 2nd 
edition or in a DNA synthesizer, to prepare oligonucleotides 
such as antisense oligonucleotide, sense oligonucleotide, etc. 
having a partial sequence of the DNA of the present invention. 

Said oligonucleotides include DNA having the same 
sequence as a sequence of consecutive 5 to 60 nucleotides in 
the objective DNA, or DNA having a complementary sequence to 
said DNA, and specific examples include DNA having the same 
sequence as a consecutive 5- to 60-nucleotide sequence in the 
nucleotide sequence represented by SEQ ID NO: 2 or 3, or DNA 
having a complementary sequence to said DNA. If these are used 
as sense and antisense primers, two oligonucleotides described 
above having similar melting temperatures (Tm) and similar 
numbers of nucleotides are preferably used. Specifically, 
mention is made of oligonucleotides having the nucleotide 
sequences represented by SEQ ID NOS : 20, 21, etc. 

Furthermore, derivatives of these oligonucleotides 
(also referred to hereinafter as oligonucleotide derivatives) 
can also be used as the oligonucleotide of the present 
invention. 

Said oligonucleotide derivatives include an 
oligonucleotide derivative in which the phosphodiester bond 
is converted into a phosphorothioate bond, an oligonucleotide 
derivative in which the phosphodiester bond is converted into 
an N3' -P5' -phosphoamidate bond, an oligonucleotide derivative 
in which the ribose and the phosphodiester bond are converted 
into a peptide-nucleic acid bond, an oligonucleotide 
derivative in which the uracil is replaced by a C-5 
propynyluracil, an oligonucleotide derivative in which the 
uracil is replaced by a C-5 thiazolyluracil , an oligonucleotide 
derivative in which the cytosine is replaced by a C-5 
propynylcytosine, an oligonucleotide derivative in which the 
cytosine is replaced by a phenoxazine-modif ied cytosine, an 
oligonucleotide derivative in which the ribose is replaced by 
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a 2' -O-propylribose, and an oligonucleotide derivative in 
which the ribose is replaced by a 2 ' -methoxyethoxyribose [Saibo 
Kogaku (Cell Engineering), 16, 1463 (1997)]. 
(2) Production of novel (31 , 3-galactosylt ransf erase 
5 polypeptide (also referred to hereinafter as the polypeptide 
of the present invention) 

The polypeptide of the present invention can be produced 
by expressing the DNA of the present invention in host cells 
in, e.g., the following manner using the methods described in 
10 Molecular Cloning, 2nd edition or Current Protocols in 
Molecular Biology Supplements 1-38, etc. 

On the basis of the full-length DNA coding for the 
polypeptide of the present invention, a DNA fragment having 
suitable length containing a region coding for said polypeptide 
15 is prepared, if necessary. 

Further, DNA useful for improving the production 
efficiency of said polypeptide can be prepared by replacing 
a nucleotide in the nucleotide sequence of the region encoding 
the polypeptide so as to make a codon most suitable for the 
20 expression in a host. 

Said DNA fragment or said full-length DNA is inserted 
into a site downstream from a promoter in a suitable expression 
vector to construct a recombinant DNA (recombinant vector) . 

By introducing said recombinant vector into host cells 
25 suitable for said expression vector, a transformant expressing 
the polypeptide of the present invention can be obtained. 

The host cells may be prokaryotic cells, yeasts, animal 
cells, insect cells, plant cells insofar as they can express 
the objective gene. Animals or plants can also be used. 
30 The expression vector used is a vector capable of 

autonomous replication or integration into the chromosome in 
the host cells and containing a promoter at a position suitable 
for transcription of the novel |3l , 3-galactosyltransf erase 
gene . 

35 When prokaryotes such as bacteria are used as the host 

cells, it is preferred that the expression vector for the novel 
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pi, 3-galactosyltransferase gene is capable of autonomous 
replication in the prokaryotes and comprises a promoter, a 
ribosome-binding sequence, the novel |3l,3- 

galactosyltransferase gene, and a transcription termination 
5 sequence. The vector may further comprise a gene regulating 
the promoter. 

The expression vector includes, e.g., pBTrp2, pBTacl, 
pBTac2 (which all are commercially available from Boehringer 
Mannheim) , pKK233-2 (a product of Pharmacia) , pSE280 (a product 

10 of Invitrogen) , pGEMEX-1 (a product of Promega) , pQE-8 (a 
product of QIAGEN) , pKYPlO (Japanese Published Unexamined 
Patent Application No. 110600/83), pKYP200 [Agric. Biol. Chem. , 
48 , 669 (1984)], pLSAl [Agric. Biol. Chem., 53., 277 (1989)], 
pGELl [Proc. Natl. Acad. Sci., USA, 82, 4306 (1985)], 

15 pBluescript II SK+ (a product of Stratagene) , pBluescript II 
SK(-) (a product of Stratagene) , pTrs30 ( FERM BP-5407) , pTrs32 
(FERM BP-5408) , pGHA2 (FERM BP-400), pGKA2 ( FERM B-67 98 ) , 
P Term2 (Japanese Published Unexamined Patent Application No. 
22979/91, US 4686191, US 4939094, US 5160735), pEG400 [J. 

20 Bacterid., 112, 2392 (1990)], pGEX (a product of Pharmacia) , 
pET system (a product of Novagen) , pSupex, pUBHO, pTP5, pC194, 
pTrxFus (a product of Invitrogen), pMAL-c2 (a product of New 
England Biolabs) , pUC19 [Gene, 33., 103 (1985)], pSTV28 (a 
product of Takara Shuzo Co . , Ltd. ) , P UC118 (a product of Takara 

25 Shuzo Co., Ltd.) and pPAl ( Japanese Published Unexamined Patent 
Application No. 233798/88). 

The promoter may be any one insofar as it is capable of 
working in host cells such as E. coli, etc. Examples are 
promoters derived from E. coli, phage, etc., such as trp_ 

30 promoter fP trp ) . lac promoter (Plac) , P L promoter, P R promoter, 
P, E promoter, etc. as well as SP01 promoter, SP02 promoter, penP 
promoter, etc. Artificially designed and modified promoters 
such as a P trp x2 promoter in which two Ptrps are combined in 
tandem, tac promoter, lacT7 promoter, letl promoter, etc. can 

35 also be used. 

A plasmid in which the distance between the Shine- 
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Dalgarno sequence (ribosome binding sequence) and the 
initiation codon is adjusted to an appropriate length (e.g., 
6 to 18 bases) may be preferably used. 

Although a transcription termination sequence is not 
5 necessarily required to express the DNA of the present 
invention, it is preferable to locate the transcription 
termination sequence just downstream from the structural gene. 

The host cells include microorganisms belonging to the 
genera Escherichia, Serratia, Bacillus, Brevibacterium, 
10 Corynebacterium, Microbacterium and Pseudomonas. Specific 
examples are Escherichia coli XLl-Blue, Escherichia coli 
XL2-Blue, Escherichia coli DH1, Escherichia coli MC1000, 
Escherichia coli KY3276, Escherichia coli W1485, Escherichia 
coli JM109, Escherichia coli HB101, Escherichia coli No. 49, 
15 Escherichia coli W3110, Escherichia coli NY4 9, Escherichia 
coli BL2MDE3), Escherichia coli BL21 (DE3) pLysS, Escherichia 
coli HMS174 (DE3) , Escherichia coli HMS174 (DE3) pLysS, Serratia 
ficaria, Serratia fonticola, Serratia liquefaciens , Serratia 
marcescensr Bacillus subtilis, Bacillus amyloliquefaciens, 
20 Brevibacterium ammoniagenes , Brevibacterium immariophilum 
ATCC14068, Brevibacterium saccharolyticum ATCC14066, 
Corynebacterium glutamicum ATCC13032, Corynebacterium 
glutamicum ATCC14067, Corynebacterium glutamicum ATCC13869, 
Corynebacterium acetoacidophilum ATCC13870, Microbacterium 
25 ammoniaphilum ATCC15354, Pseudomonas sp. D-0110, etc. 

The method of introducing the recombinant vector may be 
any method of introducing DNA into the host cells described 
above, and for example, mention can be made of the 
electroporation method [Nucleic Acids Res . , 1£, 6127 (1988)], 
30 the method using calcium ion [Proc. Natl. Acad. Sci. USA, 69, 
2110 (1972)], the protoplast method (Japanese Published 
Unexamined Patent Application No . 248394/88), and the methods 
described in Gene, 17, 107 (1982) and Molecular & General 
Genetics, 168, 111 (1979). 
35 When yeasts are used as the host cells, expression 

vectors such as YEpl3 (ATCC37115), YEp2 4 (ATCC37051), YCp50 
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(ATCC37419), pHS19, pHS15, etc. can be exemplified. 

Any promoters can be used insofar as they are capable 
of working in yeasts. For example, mention can be made of 
promoters such as PH05 promoter, PGK promoter, GAP promoter, 
5 ADH promoter, gal 1 promoter, gal 10 promoter, heat shock 
protein promoter, MF ccl promoter, CUP 1 promoter, etc. 

The host cells include yeast strains belonging to the 
genera Saccharomyces , Schizosaccharomyces , Kluyveromyces , 
Trichosporon, Schwanniomyces and Pichia. Specifically, 
10 mention can be made of Saccharomyces cerevisiae , 
Schizosaccharomyces pombe , Kluyveromyces lactis , 
Trichosporon pullulans , Schwanniomyces alluvius , Pichia 
pastoris r etc. 

The method of introducing the recombinant vector may be 
15 any method of introducing DNA into yeast, and examples include 
the electroporation method [Methods Enzymol. , 194 , 182 (1990) ] , 
the spheroplast method [Proc. Natl. Acad. Sci. USA, 84, 1929 
(1978)], the lithium acetate method [J. Bacteriol., 153, 163 
(1983)] and the method described in Proc. Natl. Acad. Sci. USA, 
20 75, 1929 (1978) . 

When animal cells are used as the host cells, mention 
can be made of, e.g., expression vectors such as pcDNAI/Amp, 
pcDNAI, pCDM8 (which all are available from Funakoshi) , pAGE107 
[Japanese Published Unexamined Patent Application No. 
25 22979/91, Cytotechnology , 3, 133 (1990)], pREP4 (a product of 
Invitrogen) , pAGEl03 [J. Biochem. , 101, 1307 (1987)], pAMo, 
pAMoA [J. Biol. Chem., 268_, 22782-22787 (1993), also called 
pAMoPRSA (Japanese Published Unexamined Patent Application 
No. 336963/93), pAS3-3 (Japanese Published Unexamined Patent 
30 Application No. 227075/90) . 

The promoter used may be any promoter capable of working 
in animal cells. Examples are the promoter of IE (immediate 
early) gene of cytomegalovirus (human CMV) , SV40 early promoter, 
the long terminal repeat promoter of moloney murine leukemia 
35 virus, the promoter of retrovirus, a heat shock promoter, SR 
a promoter and a metallothionein promoter. Furthermore, the 
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enhancer of IE gene of human CMV may be used together with the 
promoter . 

The host cells include mouse myeloma cells, rat myeloma 
cells, mouse hybridoma cells, CHO cells that are Chinese 
5 hamster cells, BHK cells, African green monkey kidney cells, 
Namalwa cells or Namalwa KJM-1 cells that are human cells, human 
embryonic kidney cells, human leukemia cells, HBT5637 
(Japanese Published Unexamined Patent Application No. 299/88) 
and human colon cancer cell lines. 
10 The mouse myeloma cells include SP2/0 and NSO; the rat 

myeloma cells include YB2/0; the human embryonic kidney cells 
include HEK293 (ATCC: CRL-1573); the human leukemia cells 
include BALL-1; the African green monkey kidney cells include 
COS-1 and COS-7; and the human colon cancer cell lines include 
15 HCT-15. 

The method of introducing the recombinant vector may be 
any method of introducing DNA into animal cells. For example, 
mention can be made of the electroporation method 
[Cytotechnology, 3_, 133 (1990)], the calcium phosphate method 
20 (Japanese Published Unexamined Patent Application No. 

227075/90) , the lipofection method [Proc. Natl. Acad. Sci . USA, 
84 , 7413 (1987)], and the method described in Virology, 52, 
456 (1973) . 

When insect cells are used as the host, the polypeptide 
25 can be expressed by methods described in Baculovirus Expression 
Vectors, A Laboratory Manual, W. H . Freeman and Company, New 
York (1992); Molecular Biology, A Laboratory Manual; Current 
Protocols in Molecular Biology Supplements 1-38; 
Bio/Technology, £, 47 (1988), and so on. 
30 That is, the recombinant gene transfer vector and a 

baculovirus are cotransf ected into insect cells so that a 
recombinant virus is obtained in the culture supernatant of 
the insect cells, and then insect cells are infected with the 
recombinant virus, whereby the polypeptide can be expressed. 
35 The gene transfer vector used in this method includes, 

e.g., pLVl392, pVL1393 and pBlueBacIII (which all are products 
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of Invitrogen) . 

As the baculovirus, it is possible to employ, e.g. 
Autographa californica nuclear polyhedrosis virus, which is 
a virus infecting insects of the family Barathra. 
5 As the insect cells, it is possible to use Spodoptera 

frugiperda ovarian cells, Trichoplusia ni ovarian cells, 
cultured cells derived from silkworm ovaries, etc. 

Spodoptera frugiperda ovarian cells include Sf 9 and Sf21 
(Baculovirus Expression Vectors, A Laboratory Manual) ; 
10 Trichoplusia ni ovarian cells include High 5 and BTI-TN-5B1-4 
(a product of Invitrogen) ; and the cultured cells derived from 
silkworm ovaries include Bombyx mori N4. 

As cotransf ection methods of the aforesaid recombinant 
gene transfer vector and the aforesaid baculovirus into insect 
15 cells to prepare the recombinant virus, the calcium phosphate 
method (Japanese Published Unexamined Patent Application No. 
227075/90) , and the lipofection method [Proc. Natl. Acad. Sci. 
USA, 84, 7413 (1987)] can be exemplified. 

The same method as for introducing DNA into animal cells 
20 can be used for introducing DNA into insect cells . For example, 
mention can be made of the electroporation method 
[Cytotechnology, 2., 133 (1990)], the calcium phosphate method 
(Japanese Published Unexamined Patent Application No. 
227075/90), and the lipofection method [Proc. Natl. Acad. Sci. 
25 USA, 81, 7413 (1987) ] . 

When plant cells or plants are used as the host, the 
polypeptide can be produced according to known methods [Soshiki 
Baiyo (Tissue Culture), 20 (1994); Soshiki Baiyo (Tissue 
Culture), 21 (1995); Trends in Biotechnology, 15, 45 (1997)]. 
30 The expression vector includes, e.g., Ti plasmid and 

tobacco mosaic virus vector. 

Any promoter capable of working in plant cells can be 
used as the promoter for the gene expression, and examples 
include 35S promoter of cauliflower mosaic virus (CaMV) , and 
35 rice actin 1 promoter. Furthermore, the efficiency of 

expression of the gene can also be raised by inserting, e.g. 
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intron 1 of the maize alcohol dehydrogenase gene into a region 
between the promoter and the gene to be expressed. 

The host cells include cells of plants such as potato, 
tobacco, maize, rice, rape, soybean, tomato, carrot, wheat, 
5 barley, rye, alfalfa, flax, etc. 

The method of introducing the recombinant vector may be 
any method of introducing DNA into plant cells. For example, 
it is possible to use a method using Agrobacterium (Japanese 
Published Unexamined Patent Application No. 140885/84, 
10 Japanese Published Unexamined Patent Application No . 70080/85, 
WO94/00977) , the electroporat ion method (Japanese Published 
Unexamined Patent Application No . 251887/85) or a method using 
a particle gun (gene gun) (Japanese Patents Nos . 2606856 and 
2517813) . 

15 The plant cells or organs to which the gene is introduced 

can be subjected to mass culture using a jar fermentor. 

The medium used in culturing includes generally employed 
media such as Murashige & Skoog (MS) medium and White medium, 
and media prepared by adding phytohormones such as auxin and 

20 cytokinin to these media. 

The culturing is conducted usually at pH 5 to 9 at 20 
to 40°C for 3 to 60 days. 

If necessary, antibiotics such as kanamycin and 
hygromycin may be added to the medium during the culturing. 

25 Further, plant cells to which the gene is introduced can 

be re-differentiated to produce a plant (transgenic plant) 
having the introduced gene. 

An animal can also be used to produce the polypeptide 
of the present invention. For example, the polypeptide of the 

30 present invention can be produced according to known methods 
[American Journal of Clinical Nutrition, 63, 639S (1996), 
American Journal of Clinical Nutrition, 63, 627S (1996), 
Bio/Technology, 9., 830 (1991) ] in an animal to which the gene 
was introduced. 

35 The promoter used may be any promoter capable of working 

in an animal. For example, mammary gland cell-specific 
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promoters such as a-casein promoter, (3-casein promoter, p- 
lactoglobulin promoter, and whey acidic protein promoter are 
preferably used. 

A transformant derived from a microorganism, animal 
5 cells or plant cells harboring a recombinant vector containing 
DNA coding for the polypeptide of the present invention is 
cultured in a usual culture method until the polypeptide is 
formed and accumulated, and the polypeptide is collected from 
the culture whereby said polypeptide can be produced. 

10 When the transformant is an animal or plant, it is raised 

or cultured in a usual manner until the polypeptide is formed 
and accumulated, and the polypeptide is recovered from the 
animal or plant whereby said polypeptide can be produced. 
That is, in the case of an animal, for example, a 

15 non-human transgenic animal harboring the DNA of the present 
invention is raised until the novel polypeptide having 
(31 , 3-galactosyltransf erase activity, encoded by said 
recombinant DNA, is formed and accumulated in the animal, and 
the polypeptide is collected from the animal whereby the novel 

20 polypeptide having pi, 3-galactosyltransf erase activity can be 
produced. The place where the polypeptide is formed and 
accumulated in the animal is, for example, milk and egg of the 
animal . 

In the case of the plant, for example, a transgenic plant 
25 harboring the DNA of the present invention is cultured until 
the novel polypeptide having pi , 3-galactosyltransf erase 
activity, encoded by said recombinant DNA, is formed and 
accumulated in the plant, and the polypeptide is recovered from 
the plant whereby the novel polypeptide having pi,3- 
30 galactosyltransf erase activity can be produced. 

When the transf ormants for producing the polypeptide of 
the present invention are prokaryotes such as E. coli, etc. 
or eucaryotes such as yeast, etc., the polypeptide of the 
present invention can be produced by culturing the 
35 transf ormants of the present invention in a medium to thereby 
form and accumulate the polypeptide of the present invention 
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in the culture, and collecting said polypeptide from said 
culture . 

The transf ormant of the present invention in the medium 
can be cultured according to conventional methods used for 
5 culturing the host. 

The medium for culturing the transformant obtained by 
using prokaryotes such as E. coli or eucaryotes such as yeast 
may be natural or synthetic medium insofar as the medium 
contains a carbon source, a nitrogen source, inorganic salts, 
10 etc. which can be assimilated by the organisms and in which 
the transf ormants can be efficiently cultured. 

Any carbon source can be used insofar as it can be 
assimilated by the microorganisms, and the following can be 
used: carbohydrates such as glucose, fructose, sucrose, 
15 molasses containing them, starch and starch hydrolyzates; 

organic acids such as acetic acid and propionic acid; alcohols 
such as ethanol and propanol, and the like. 

As the nitrogen source, the following can be used: 
ammonia; ammonium salts of various inorganic acids or organic 
20 acids, such as ammonium chloride, ammonium sulfate, ammonium 
acetate, and ammonium phosphate; other nitrogen-containing 
compounds; peptone; meat extracts; yeast extracts; corn steep 
liquor; casein hydrolyzates; soy bean meal; soy bean meal 
hydrolyzates; various fermented cells and hydrolyzates 
25 thereof; and the like. 

As the inorganic salts, the following can be used: 
potassium dihydrogenphosphate, dipotassium hydrogenphosphate , 
magnesium phosphate, magnesium sulfate, sodium chloride, 
ferrous sulfate, manganese sulfate, copper sulfate, calcium 
30 carbonate, and the like. 

The culturing is conducted under aerobic conditions 
using, e.g., shaking culture or submerged spinner culture under 
aeration . 

The culturing temperature is preferably 15 to 40°C, and 
35 the culturing period is usually 5 hours to 7 days. 

During the culturing, pH is maintained at 3.0 to 9.0. 
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Adjustment of the medium pH is conducted using an inorganic 
or organic acid, an alkaline solution, urea, calcium carbonate , 
ammonia, and the like. 

If necessary, antibiotics such as ampicillin and 
5 tetracycline may further be added to the medium during the 
culturing . 

For culturing a microorganism transformed with an 
expression vector using an inducible promoter as a promoter, 
an inducer may be added to the medium, if necessary. For 

10 example, for culturing a microorganism transformed with an 
expression vector using lac promoter, isopropyl-p-D- 
thiogalactopyranoside, or the like, may be added to the medium; 
for culturing a microorganism transformed with an expression 
vector using trp promoter, indole acrylic acid, or the like, 

15 may be added to the medium. 

When the transf ormants for producing the polypeptide of 
the present invention are animal cells, the medium for 
culturing the cells is a generally employed medium such as 
RPMI164 0 medium [The Journal of the American Medical 

20 Association, 199, 519 (1967)], Eagle' s MEM medium [Science, 
122, 501 (1952)], DMEM medium [Virology, 8, 396 (1959)], 199 
medium [Proceeding of the Society for the Biological Medicine, 
23., 1 (1950) ] or any of these media further supplemented with 
fetal calf serum. 

25 The culturing is conducted usually at pH 6 to 8 at 25 

to 40°C for 1 to 7 days in the presence of 5% C0 2 . 

If necessary, antibiotics such as kanamycin, penicillin 
and streptomycin may be added to the medium during the 
culturing . 

30 When the transf ormants for producing the polypeptide of 

the present invention are insect cells, the medium for 
culturing said cells may be a generally employed medium such 
as TNM-FH medium (a product of Pharmingen) , Sf-900 II SFM medium 
(a product of GIBCO BRL) , ExCell 400 and ExCell 405 [both are 

35 products of JRH Biosciences] and Grace' s Insect Medium [Nature, 
195, 788 (1962) ] . 
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The culturing is conducted usually at pH 6 to 7 at a 
temperature of 25 to 30°C for 1 to 5 days. 

If necessary, antibiotics such as gentamycin may be added 
to the medium during the culturing. 
5 The method of expressing the gene includes a method of 

expressing a partial polypeptide containing a region having 
pi, 3-galactosyltransf erase activity besides the full-length 
polypeptide. Generally, glycosyltransf erase has the topology 
of type 2 transmembrane protein and consists of a cytoplasmic 

10 region consisting of a few to decades of N-terminal amino acids, 
a membrane-binding region containing highly hydrophobic amino 
acids, a stem region consisting of a few to decades of amino 
acids, and a large C-terminal region containing a catalytic 
domain. It is estimated that the stem region and the large 

15 C-terminal region containing a catalytic domain are exposed 
to the lumen of the Golgi body. The border between the stem 
region and the catalytic domain can be experimentally 
determined by examining the disappearance of the activity of 
the polypeptide while deleting its N-terminal region. The 

20 stem region and the catalytic domain can also be estimated by 
comparison with the amino acid sequence of analogous 
glycosyltransf erase whose stem region and catalytic domain 
have been revealed. 

The structure of the novel pi , 3-galactosyltransf erase 

25 of the present invention also has similar structure to those 
of other glycosyltransf erases . 

For example, the polypeptide of the present invention 
having the amino acid sequence represented by SEQ ID NO: 1 
consists of a cytoplasmic region consisting of N-terminal seven 

30 amino acids, a highly hydrophobic membrane-binding region 
consisting of 19 amino acids, a stem region consisting of at 
least 4 amino acids, and a large C-terminal region containing 
a catalytic domain. On the basis of comparison of its homology 
in amino acid sequence to other pi , 3-galactosyltransf erases 

35 and information on the stem region and catalytic domain of other 
Pi , 3-galactosyltransf erases [Japanese Published Unexamined 
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Patent Application No. 181759/94], it is estimated that the 
stem region consists of at least 4 amino acids. Accordingly, 
a partial polypeptide containing the amino acid sequence of 
31 to 310 is considered to contain the catalytic domain. 
5 The full-length polypeptide described above or the 

partial polypeptide containing the region having (31,3- 
galactosyltransf erase activity (catalytic domain) can be 
expressed directly or as a secreted protein or fusion protein 
in accordance with the methods described in Molecular Cloning, 

10 2nd edition. The protein to be fused includes |3-galactosidase , 
protein A, IgG-binding region of protein A, chloramphenicol 
acetyltransferase, poly(Arg), poly(Glu), protein G, 
maltose-binding protein, glutathione S-transf erase, 
polyhistidine chain (His-tag) , S peptide, DNA-binding protein 

15 domain, Tac antigen, thioredoxin, green fluorescence protein, 
and any antibody epitope [Akio Yamakawa: Jikken Igaku 
(Experimental Medicine), 13, 469-474 (1995)]. 

The method of producing the polypeptide of the present 
invention includes intracellular production by host cells, 

20 extracellular secretion by host cells or production on outer 
membranes by host cells, and the method can be selected 
depending on the host cells used or on alteration of the 
structure of the polypeptide to be produced. 

When the polypeptide of the present invention is produced 

25 in host cells or on outer membranes of host cells, it is possible 
to force the polypeptide to be secreted outside the host cells 
by use of the method of Paulson, et al. [J. Biol. Chem. , 264 , 
17619 (1989)], the method of Lowe, et al. [Proc. Natl. Acad. 
Sci., USA, 86, 8227 (1989), Genes Develop., 4, 1288 (1990)] 

30 or methods described in Japanese Published Unexamined Patent 
Application No. 336963/93, WO94/23021, etc. 

That is, the polypeptide of the present invention can 
be secreted outside the host cells by expressing it as a form 
in which a signal peptide is added upstream of a polypeptide 

35 portion containing the active site of the polypeptide of the 
present invention, which can be achieved using gene 
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manipulation techniques. 

Specifically, it is possible to force the polypeptide 
of the present invention to be secreted outside the host cells 
by expressing it in the form in which a signal peptide is added 
5 upstream of a polypeptide portion containing the amino acid 
sequence of 31 to 310 considered to contain the catalytic site. 
Further, a tag for purification and detection can also be added 
between the signal peptide and the polypeptide containing the 
catalytic domain or to the C-terminus of the polypeptide 

10 containing the catalytic domain . The tag for purification and 
detection includes p-galactosidase, protein A, IgG-binding 
region of protein A, chloramphenicol acetyltransf erase, 
poly(Arg), poly(Glu), protein G, maltose-binding protein, 
glutathione S-transf erase, polyhistidine chain (His-tag) , S 

15 peptide, DNA-binding protein domain, Tac antigen, thioredoxin, 
green fluorescence protein, and any antibody epitope [Akio 
Yamakawa: Jikken Igaku (Experimental Medicine), 13, 469-474 
(1995) ] . 

The amount of the polypeptide produced can be increased 
20 by a gene amplification system using a dihydrof olate reductase 

gene, or the like, according to the method described in Japanese 

Published Unexamined Patent Application No. 227075/90. 

For isolation and purification of the polypeptide of the 

present invention from a culture of the transformant for 
25 producing the polypeptide of the present invention, 

conventional methods for the isolation and purification of 

enzymes can be used. 

For example, when the polypeptide of the present 

invention is accumulated in a soluble form in the transformant 
30 cells for producing the polypeptide of the present invention, 

the cells are recovered from the culture by centrif ugation, 

then washed and disrupted with a sonicator, French Press, 

Manton-Gaulin homogenizer, Dynomill, or the like, to obtain 

a cell-free extract. 
35 A purified preparation can be obtained by centrifuging 

the cell-free extract to obtain the supernatant and then by 
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subjecting the supernatant to extraction with a solvent, 
salting-out with ammonium sulfate, etc., desalting, 
precipitation with organic solvent, anion-exchange 
chromatography on resin such as diethylaminoethyl ( DEAE ) - 
5 Sepharose, DIAION HPA-75 (a product of Mitsubishi Chemical 
Industries Ltd. ) , or the like, cation-exchange chromatography 
on resin such as S-Sepharose FF (a product of Pharmacia) , or 
the like, hydrophobic chromatography on resin such as butyl 
Sepharose, phenyl Sepharose, or the like, gel filtration using 

10 molecular sieve, affinity chromatography, chromatof ocusing, 
and electrophoresis such as isoelectric focusing, or the like. 

When said polypeptide is expressed as an inclusion body 
in cells, the cells are similarly recovered, disrupted and 
centrifuged to obtain a precipitate fraction. From the 

15 fraction, the polypeptide is then recovered in a usual manner, 
and the inclusion body of the polypeptide is solubilized with 
a polypeptide denaturating agent. The solubilized solution 
is then diluted with, or dialyzed against, a solution not 
containing the polypeptide-denaturating agent or a solution 

20 containing the polypeptide-denaturating agent at such a low 
concentration that denaturation of the polypeptide is not 
caused, whereby the solubilized polypeptide is renatured to 
have normal higher-order structure, and its purified 
preparation can be obtained by use of the same isolation and 

25 purification methods as described above. 

When said polypeptide is extracellularly secreted, the 
culture is subjected to means such as centrif ugation to give 
a soluble fraction. From the soluble fraction, a purified 
preparation of said polypeptide can be obtained in the same 

30 manner as for isolation and purification from the cell-free 
extract as described above. 

Further, the polypeptide of the present invention can 
be purified in accordance with a usual method of purifying 
transferase [Methods in Enzymology, £3., 458]. 

35 Furthermore, it is possible to produce the polypeptide 

of the present invention as a fusion protein with another 
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protein and to purify it by affinity chromatography using a 
substance having affinity for the fused protein [Akio Yamakawa : 
Jikken Igaku (Experimental Medicine), 13., 469-474 (1995)]. 

For example, according to the method of Lowe, etal. [Proc. 
5 Natl. Acad. Sci. , USA, 86, 8227 (1989) , Genes Develop. , 4, 128 8 
(1990)] or the methods described in Japanese Published 
Unexamined Patent Application No. 336963/93 and WO94/23021, 
the polypeptide of the present invention can be produced as 
a fusion protein with protein A and can be purified by affinity 

10 chromatography using immunoglobulin G. 

Further, it is possible to produce the polypeptide of 
the present invention as a fusion protein with a FLAG peptide 
and to purify it by affinity chromatography using anti-FLAG 
antibody [Proc. Natl. Acad. Sci., USA, £6, 8227 (1989), Genes 

15 Develop., 4, 1288 (1990)]. 

Furthermore, the polypeptide can also be purified by 
affinity chromatography using an antibody against said 
polypeptide itself. 

The polypeptide of the present invention can be produced 

20 by i_n vitro transcription-translation system in accordance 
with known methods [J. Biomolecular NMR, 6, 129-134, Science, 
242, 1162-1164, J. Biochem. , 110, 166-168 (1991)]. 

On the basis of the amino acid information on the 
polypeptide obtained above, the polypeptide of the present 

25 invention can also be produced by chemical synthetic methods 
such as the Fmoc method (the f luorenylmethyloxycarbonyl 
method), the tBoc method (the t-butyloxycarbonyl method) , etc. 
The polypeptide can also be chemically synthesized by a peptide 
synthesizer from Advanced ChemTech, Perkin Elmer, Pharmacia 

30 Biotech, Protein Technology Instrument, Synthecell-Vega, 
PerSeptive, Shimadzu Corporation, etc. 

The structural analysis of the purified polypeptide of 
the present invention can be carried out according to the method 
described in, e.g., Structural Analysis of Protein for Gene 

35 Cloning (written by Hisashi Hirano and published by Tokyo 
Kagaku Dojin, 1993) . 
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The pi, 3-galactosyltransf erase activity of the 
polypeptide of the present invention can be measured according 
to a known measuring method [J. Biol. Chem. , 258 . 9893-9898 
(1983) , J. Biol. Chem. , 262? 15649-15658 (1987) , Archi. Biochem. 
5 Biophys., 270, 630-646 (1989), Archi. Biochem. Biophys., 274 f 
14-25 (1989) , Japanese Published Unexamined Patent 
Application No. 181759/94, J. Biol. Chem., 273 . 58-65 (1998), 
J. Biol. Chem., 273, 433-440 (1998), J. Biol. Chem., 273, 
12770-12778 (1998) ] . 

10 (3) Production of sugar chains having galactose transferred 
via |3l , 3-linkage to N-acetylglucosamine, N-acetylglucosamine 
residue, glucose, or glucose residue thereof or complex 
carbohydrates containing said sugar chains. 

Sugar chains having galactose transferred via (31,3- 

15 linkage to N-acetylglucosamine, N-acetylglucosamine residue, 
glucose or glucose residue thereof or complex carbohydrates 
containing said sugar chains can be produced by culturing in 
a medium a transformant selected from the group consisting of 
the transf ormants derived from microorganisms, animal cells, 

20 plant cells and insect cells obtained in item (2) above, to 
thereby form and accumulate said sugar chains or said complex 
carbohydrates in the culture, and collecting said sugar chains 
or said complex carbohydrates from the culture. 

The sugar chains having galactose transferred via 

25 (31 , 3-linkage to N-acetylglucosamine residue thereof include 
sugar chains having sialyl-Lewis a structure, sugar chains 
having sialyl-Lewis c structure, sugar chains having Lewis a 
structure, sugar chains having Lewis b structure, sugar chains 
having Galccl-3Gal|3l-3GlcNAc structure, sugar chains having 

30 Galal-3 ( Fucal-2 ) Gal[3l-3GlcNAc structure and sugar chains 
having GalNAcal-3 ( Fucal-2 ) Gal|31-3GlcNAc structure. 

The culturing can be conducted in accordance with item 
(2) above. 

By producing the polypeptide of the present invention 
35 and any recombinant glycoprotein (e.g., pharmaceutical 

recombinant glycoprotein) simultaneously in the transformant 
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described above, sugar chains having galactose transferred via 
pi, 3-linkage to N-acetylglucosamine , N-acetylglucosamine 
residue, glucose or glucose residue thereof can be added to 
the recombinant glycoprotein. 
5 Further, the sugar chains having galactose transferred 

via |31, 3-linkage to N-acetylglucosamine, N-acetylglucosamine 
residue, glucose or glucose residue thereof, or complex 
carbohydrates containing said sugar chains added thereto, can 
be produced in accordance with the method described in item 
10 (2) above by using the animal or plant obtained in item (2) 
above . 

That is, in the case of the animal, for example, a 
non-human transgenic animal harboring the DNA of the present 
invention is raised so that sugar chains having galactose 

15 transferred via pi , 3-linkage to N-acetylglucosamine, N- 

acetylglucosamine residue, glucose or glucose residue thereof, 
or complex carbohydrates containing said sugar chains added 
thereto, are formed and accumulated in the animal, and said 
products are recovered from the animal whereby the sugar chains 

20 having galactose transferred via pi, 3-linkage to N- 

acetylglucosamine, N-acetylglucosamine residue, glucose or 
glucose residue thereof, or complex carbohydrates containing 
said sugar chains added thereto, can be produced. 

The place where these products are formed and accumulated 

25 in the animal is, for example, milk and egg of the animal. 

In the case of the plant, for example, a transgenic plant 
harboring the DNA of the present invention is cultured so that 
sugar chains having galactose transferred via pi, 3-linkage to 
N-acetylglucosamine, N-acetylglucosamine residue, glucose or 

30 glucose residue thereof, or complex carbohydrates containing 
said sugar chains added thereto, are formed and accumulated 
in the plant, and said products are recovered from the plant 
whereby the sugar chains having galactose transferred via 
pi, 3-linkage to N-acetylglucosamine, N-acetylglucosamine 

35 residue, glucose or glucose residue thereof, or complex 

carbohydrates containing said sugar chains added thereto, can 
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be produced. 

Reaction products having galactose transferred via 
(31 , 3-linkage to N-acetylglucosamine residues present at the 
non-reducing termini of sugar chain thereof or to N- 
5 acetylglucosamine monosaccharides can be produced in an 

aqueous medium in the following method using the polypeptide 
of the present invention obtained in the method described in 
item (2) above as the enzyme source. 

That is, N-acetylglucosamine monosaccharides, 

10 oligosaccharides having N-acetylglucosamine residue at the 
non-reducing termini thereof, or complex carbohydrates having 
N-acetylglucosamine residue at the non-reducing termini of 
sugar chains thereof, are used as the acceptor substrate while 
the polypeptide of the present invention obtained in the method 

15 described in item (2) above is used as the enzyme source, and 
reaction products having galactose transferred via |3l,3- 
linkage to N-acetylglucosamine or N-acetylglucosamine residue 
of the acceptor substrate can be produced by allowing said 
receptor substrate, said enzyme source and uridine-5'- 

20 diphosphate galactose (UDP-Gal) to be present in an aqueous 
medium to thereby form and accumulate said reaction products 
in said aqueous medium and collecting said reaction products 
from said aqueous medium. 

Reaction products having galactose transferred via 

25 pi, 3-linkage to glucose residues present at the non-reducing 
termini of sugar chains thereof or to glucose monosaccharides 
can be produced in an aqueous medium in the following method 
using the polypeptide of the present invention obtained in the 
method described in item (2) above as the enzyme source. 

30 That is, glucose monomers, oligosaccharides having 

glucose residue at the non-reducing termini thereof, or complex 
carbohydrates having glucose residue at the non-reducing 
termini of sugar chains thereof are used as the acceptor 
substrate while the polypeptide of the present invention 

35 obtained in the method described in item (2) above is used as 
the enzyme source, and reaction products having galactose 
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transferred via |3l , 3-linkage to glucose monomers or glucose 
residues of the receptor substrate can be produced by allowing 
said acceptor substrate, said enzyme source and UDP-Gal to be 
present in an aqueous medium to thereby form and accumulate 
5 said reaction products in said aqueous medium and collecting 
said reaction products from said aqueous medium. 

The enzyme source is used at a concentration of 0.1 mU/1 
to 10, 000 U/l, preferably 1 mU/1 to 1, 000 U/l where 1 U refers 
to the activity by which 1 ^xmol lacto-N-tetraose, Galpl- 
10 3GlcNAcpl-3Galpl-4Glc can be formed at 37°C for 1 minute from 
agalact lacto-N-neotetraose, GlcNAc|3l-3Gal(3l-4Glc as the 
substrate . 

The aqueous medium includes water, buffers such as 
phosphates, carbonates, acetates, borates, citrates and Tris, 

15 alcohols such as methanol and ethanol, esters such as ethyl 
acetate, ketones such as acetone, and amides such as acetamide. 
Further, the culture solution of the microorganism used as the 
enzyme source can be used as the aqueous medium. A 
transf ormant ' s culture solution obtained by the culturing 

20 described in item (2) above, or milk obtained from the non-human 
transgenic animal described in item (2) above, can also be used 
as the aqueous medium. 

A surfactant or an organic solvent may be added as 
necessary to the aqueous medium. 

25 The surfactant may be any one capable of promoting 

formation of Gal-containing sugars, and examples of such 
surfactants include nonionic surfactants such as 
polyoxyethylene octadecylamine (e.g., NimeanS-215, a product 
of Nippon Oil and Fats Co., Ltd.), cationic surfactants such 

30 as cetyl trimethyl ammonium bromide and alkyl dimethyl benzyl 
ammonium chloride (e.g., Cation F2-40E, a product of Nippon 
Oil and Fats Co., Ltd.), anionic surfactants such as lauroyl 
sarcosinate, tertiary amines such as alkyl dimethyl amine (e.g., 
tertiary amine FB, a product of Nippon Oil and Fats Co. , Ltd. ) , 

35 and these can be used alone or in combination thereof. 

The surfactant is used usually at a concentration of 0.1 
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to 50 g/1. 

The organic solvent includes xylene, toluene, fatty 
alcohol, acetone and ethyl acetate, and is used usually at a 
concentration of 0.1 to 50 ml/1. 
5 As UDP-Gal, a commercial product or a reaction solution 

formed by the activity of a microorganism, etc. or a purified 
product from the reaction solution can be used. The UDP-Gal 
can be used at a concentration of 0.1 to 500 mmol/1. 

The oligosaccharides having N-acetylglucosamine 
10 residue at the non-reducing termini thereof include 

GlcNAcpl-3Gaipi-4Glc, GlcNAcpl-3Galpl-4GlcNAc, GlcNAcpl- 
3 (GlcNAcpl-6) Gaipi-4Glc, GlcNAcpi-3 (GlcNAc£l-6) Gaipi- 
4GlcNAc, GlcNAcpl-3GalNAc, GlcNAcpi- 6GalNAc, and 
oligosaccharides having one of said oligosaccharide 
15 structures at the non-reducing termini of sugar chains thereof. 

The complex carbohydrates having N-acetylglucosamine 
residue at the non-reducing termini of sugar chains thereof 
include complex carbohydrates having one of said 
.oligosaccharide structures at the non-reducing termini of 
20 sugar chains thereof, or complex carbohydrates containing 
asialogalacto complex N-linked sugar chains. 

The acceptor substrate can be used at a concentration 
of 0.01 to 500 mmol/1. 

In said formation reaction, inorganic salts such as MnCl 2i 
25 etc., p-mercaptoethanol, polyethylene glycol, etc. can be 
added as necessary. 

The formation reaction is carried out at 20 to 50°C for 
1 to 96 hours in an agueous medium, pH 5 to 10, preferably pH 
6 to 8 . 

30 From the sugar chains or complex carbohydrates produced 

by the method described above, a part of the sugar chains can 
be cleaved off by conventional enzymatic or chemical means 
[" Zoku Seikagaku Jikken Koza" (Sequel to Lecture of Experiments 
in Biochemistry), Vol. 4, " Fukugotoshit su Kenkyuho I & II" 

35 (Method for Study of Complex Carbohydrates) , ed. by Japanese 
Biochemistry Society and published by Tokyo Kagaku Dojin K. 
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K. (1986); N. Taniguchi, A. Suzuki, K. Furukawa, K. Sugawara: 
Experimental Protocol in Glycobiology, published by Shujunsha 
(1996) ] . 

(4) Preparation of an antibody recognizing the polypeptide of 
5 the present invention 

(i) Preparation of a polyclonal antibody 

A polyclonal antibody can be prepared by using a purified 
product of the full-length polypeptide or a partial fragment 
of the polypeptide obtained by the method described in item 
10 (2) above as the antigen and administering the antigen to an 
animal . 

The animals to which the antigen is administered include 
rabbits, goats, rats, mice and hamsters. 

Preferable dosage of antigen is 50 to 100 \iq per animal. 

15 When a peptide is used as the antigen, it is preferred 

to use the peptide as the antigen after binding it covalently 
to a carrier protein such as keyhole limpet hemocyanin, bovine 
thyroglobulin, or the like. The peptide used as the antigen 
can be synthesized by a peptide synthesizer. 

20 Administration of the antigen is carried out 3 to 10 times 

at one- to two-week intervals after the first administration. 
A blood sample is recovered from the fundus oculi veniplex of 
the eye on the third to seventh day after each administration, 
and the serum is examined for reactivity to the antigen used 

25 for immunization, for example, by enzyme immunoassay 

[Enzyme-linked Immunosorbent Assay (ELISA) , published by 
Igaku Shoin (1976); Antibodies - A Laboratory Manual, Cold 
Spring Harbor Laboratory (1988)]. 

A polyclonal antibody can be prepared by obtaining the 

30 serum from a non-human mammal whose serum shows a sufficient 
antibody titer against the antigen used for immunization, 
isolating and purifying it from the serum. 

With regard to the method for isolation and purification 
of the polyclonal antibody, centrif ugat ion, salting-out 

35 method with 40 to 50% saturated ammonium sulfate, caprylic acid 
precipitation method [Antibodies, A Laboratory manual, Cold 
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Spring Harbor Laboratory (1988) ] , or chromatographic methods 
using a DEAE-Sepharose column, an anion exchange column, a 
protein A or G column, a gel filtration column, and the like, 
may be employed alone or in combination. 
5 (ii) Preparation of a monoclonal antibody 

(a) Preparation of antibody-producing cells 

The non-human mammal whose serum shows adequate antibody 
titer against a partial fragment of the polypeptide of the 
present invention used in immunization is used as a source of 

10 antibody-producing cells. 

On the third to seventh day after the final 
administration of the antigen to a rat with said antibody titer, 
the spleen is excised from the rat. The spleen is cut into 
pieces in MEM medium ( a product of Nissui Pharmaceuticals, Co.) 

15 and the pieces are then loosened with tweezers, followed by 
centrifugation at 1,200 rpm for 5 minutes, to discard the 
resulting supernatant. 

The spleen cells in the resulting precipitated fraction 
are treated with a Tris-ammonium chloride buffer (pH 7.65) for 

20 1 to 2 minutes to remove erythrocytes, followed by washing 3 
times with MEM medium to give spleen cells as antibody- 
producing cells. 

(b) Preparation of myeloma cells 

As myeloma cells, cell lines obtained from mice or rats 
25 are used. For example, 8-azaguanine-resistant mice 

(BALB/c-derived) myeloma cell line P3-X63Ag8-Ul (hereinafter 
abbreviated to P3-U1) [Curr. Topics. Microbiol. Immunol., ill, 
1 (1978) , Europ. J. Immunol. , 6, 511 (1976)], SP2/0-Agl4 (SP-2) 
[Nature, 276 , 269 (1978)], P3-X63-Ag8 653 ( 653 ) [J. Immunol., 
30 123. r 1548 (1979)] and P3-X63-Ag8 (X63) [Nature, 256, 495 (1975)] 
can be used. These cell lines are further subjected to 
subculture in 8-azaguanine medium [medium prepared by adding 
8-azaguanine (15 ng/ml) to a medium (referred to hereinafter 
as normal medium) having glutamine (1.5 mmol/1), 2- 
35 mercaptoethanol (5x10 5 mo 1 / 1 ) , gentamicin (10jig/ml) and fetal 
calf serum (FCS) (a product of CSL Ltd. ; 10%) added to RPMI-1640 



43 



medium] , and 3 to 4 days before cell fusion, they are cultured 
in the normal medium and at least 2xl0 7 cells are used for 
fusion . 

(c) Preparation of hybridoma 
5 The antibody-producing cells obtained in item (a) above 

and myeloma cells obtained in item (b) above are washed well 
with MEM medium or PBS (1.83 g of disodium phosphate, 0.21 g 
of monopotassium phosphate, 7.65 g of common salt, 1 L of 
distilled water, pH 7.2) and mixed such that the ratio of the 

10 antibody-producing cells/myeloma cells ranges from 5/1 to 10/1, 
and the mixture is centrifuged at 1,200 rpm for 5 minutes and 
the supernatant is discarded. 

The cell pellet obtained as the precipitated fraction 
is well loosened, and a mixture containing 2 g of polyethylene 

15 glycol-1000 (PEG-1000), 2 ml of MEM and 0.7 ml of dimethyl 
sulfoxide (DMSO) is added to the cells in a volume of 0.2 to 
1 ml/10 8 antibody-producing cells with stirring at 37°C, and 
1 to 2 ml of MEM medium is added thereto several times at 1- 
to 2-minute intervals. 

20 After addition, MEM medium is added to adjust the total 

volume to 50 ml. The solution thus prepared is centrifuged 
at 900 rpm for 5 minutes, and the supernatant is discarded. 

The cells obtained in the precipitated fraction are 
gently loosened and suspended by pipetting in 100 ml of HAT 

25 medium [the medium prepared by adding hypoxanthine (10" 4 mol/l), 
thymidine ( 1 . 5xl0" 5 mol/1 ) and aminopterin ( 4xl0~ 7 mol/1 ) to the 
normal medium] . 

The suspension is put to each well on a 96-well culture 
plate (100}xl/well) and cultured at 37°C in a 5% C0 2 incubator 

30 for 7 to 14 days. 

After culturing, an aliquot of the supernatant is sampled 
and a hybridoma reacting specifically with a partial fragment 
of the polypeptide of the present invention is selected by 
enzyme immunoassays described in, e.g., "Antibodies" 

35 [Antibodies, A Laboratory Manual, Cold Spring Harbor 
Laboratory, Chapter 14 (1988)]. 
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Specifically, enzyme immunoassays are conducted as 
follows : 

An appropriate plate is coated with a partial fragment 
of the polypeptide of the present invention, which is used as 
5 an antigen for immunization, followed by reaction with a 
culture supernatant of the hybridoma or with the purified 
antibody obtained in (d) below as a first antibody and then 
with anti-rat or anti-mouse immunoglobulin antibody as a second 
antibody labeled with biotin, an enzyme, a chemiluminescent 

10 substance or a radioisotope. Then, reaction depending on the 
labeling substance is conducted, and a hybridoma reacting 
specifically with the polypeptide of the present invention is 
selected as a hybridoma producing the monoclonal antibody of 
the present invention. 

15 Using the hybridoma, cloning is repeated twice by 

limiting dilution [for first dilution, HT culture medium 
(aminopterin-f ree HAT medium) is used; for second dilution, 
the normal medium is used] . A hybridoma showing a stable and 
strong antibody titer is selected as the hybridoma producing 

20 an antibody against the polypeptide of the present invention, 
(d) Preparation of a monoclonal antibody 

The hybridoma cells producing a monoclonal antibody 
against the polypeptide of the present invention, obtained in 
item (c) above, are injected at a dose of 5 to 20xl0 6 

25 cells/animal into the abdomens of 8 to 10-week-old mice or nude 
mice treated with 0 . 5 ml Pristane [animals raised for 2 weeks 
after intraperitoneal administration of 2,6,10,14- 
tetramethylpentadecane (Pristane) ] . The hybridoma forms 
ascites tumor in 10 to 21 days. 

30 From the mouse with the ascites tumor, the ascites is 

collected and centrifuged at 3, 000 rpm for 5 minutes, to remove 
the solid matters from the fluid. 

From the resulting supernatant, the monoclonal antibody 
can be purified and obtained according to the same method as 

35 used for the polyclonal antibody. 

The subclass of the antibody is determined using a mouse 
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monoclonal antibody typing kit or a rat monoclonal antibody 
typing kit. The class of the antibody means isotype of the 
antibody, and for example, mention can be made of IgG, IgA, 
IgM, IgD and IgE in human. The subclass of the antibody means 
5 isotype in the class, and for example, mention can be made of 
IgGl, IgG2a, IgG2b and IgG3 in mouse, and IgGl, IgG2, IgG3 and 
IgG4 in human. 

The amount of protein of the antibody is calculated by 
the Lowry method or from its absorbance at 280 nm. 

10 (5) Application of the DNA or oligonucleotide of the present 
invention to treatment, diagnosis, etc. of diseases 

The DNA of the present invention can be applied to 
treatment of diseases, such as inhibition of cancer metastasis, 
etc., by use of antisense RNA/DNA technology [Bioscience and 

15 Industry, 50, 322 (1992) ; "Kagaku" (Chemistry) , 4£, 681 (1991) ; 
Biotechnology, 9, 358 (1992); Trends in Biotechnology, 10, 87 
(1992); Trends in Biotechnology, 10, 152 ( 1992 ) ; w Saibo Kogaku" 
(Cell Engineering) , 16, 1463 (1997) ] or by use of triple helix 
technology [Trends in Biotechnology, 10_, 132 (1992)] as well 

20 as to diagnosis of such diseases by use of Northern 
hybridization or PCR techniques. 

For example, production of the polypeptide of the present 
invention can be inhibited by administering the DNA or 
oligonucleotide of the present invention described in item (1) 

25 above or its derivative. 

That is, the DNA or oligonucleotide of the present 
invention or a derivative thereof can be used for inhibition 
of transcription of DNA coding for the polypeptide of the 
present invention or for inhibition of translation of mRNA 

30 coding for the polypeptide of the present invention. 

Furthermore, expression level of DNA coding for the 
polypeptide of the present invention can be determined by 
Northern hybridization or PCR using the DNA of the present 
invention or the above oligonucleotide prepared from the DNA. 

35 A promoter region of said gene can be obtained by using 

the DNA of the present invention as a probe in a known method 
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["Shin Saibo Kogaku Jikken Protocol" (New Experimental 
Protocol in Cell Engineering) , edited by Department of 
Anticancer Research, Medical Research Institute, Tokyo 
University and published by Shujunsha (1993) ] . 
5 At present, the sequences of many human chromosomal genes 

whose functions are not known are registered in a database. 
Accordingly, by comparing the sequence of human cDNA coding 
for the polypeptide of the present invention with the sequences 
of human chromosomal genes registered in a database, the human 

10 chromosomal gene coding for the polypeptide of the present 
invention may be identified to reveal the structure of said 
gene. If a chromosomal gene sequence consistent with the 
sequence of said cDNA is registered, the promoter region and 
exon and intron structures in the chromosomal gene coding for 

15 the polypeptide of the present invention can be determined by 
comparing the sequence of the cDNA with the sequence of the 
above consistent chromosomal gene. 

The promoter region may be any promoter region 
participating in transcription of the gene coding for the 

20 polypeptide of the present invention in mammalian cells. For 
example, mention can be made of promoter regions participating 
in transcription of the gene coding for the polypeptide of the 
present invention in human colon cancer cells or human 
pancreatic cancer cells. For example, mention can be made of 

25 a promoter DNA having a consecutive 50- to 5000-bp sequence 
in the nucleotide sequence of 1 to 5000 in the nucleotide 
sequence represented by SEQ ID NO: 3. Said promoter can be 
used in the screening method described below. 

It is known that polymorphism and mutations occur in a 

30 glycosyltransf erase gene. With respect to 

glycosyltransf erase involved in determination of ABO blood 
type, for example, the following 3 enzymes are formed according 
to the difference in amino acid sequence based on gene 
polymorphism: 

35 al, 3-N-acetylgalactosaminyltransferase involved in 

synthesis of A type antigen, al , 3-galactosyltransf erase 
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involved in synthesis of B type antigen, and an enzyme having 
no activity, which is involved in formation of 0(H) type sugar 
chains [Nature, 345, 229-233 (1990)]. 

It is also known that the activity of al,3- 
5 f ucosyltransf erase (Fuc-TIII) involved in determination of 
Lewis blood type is reduced or disappears depending on the 
difference in amino acid sequence based on gene polymorphism 
[J. Biol. Chem., 269, 29271-29278 (1994), Blood, 82, 2915- 
2919 (1993), J. Biol. Chem., 269, 20987-20994 (1994), J. Biol. 
10 Chem., 272, 21994-21998 (1997)]. 

It is known that the polymorphism of Fuc-TIII gene has 
a close relationship with expression of sialyl-Lewis a sugar 
chains that are cancer-related sugar chain antigens in colon 
cancers [Cancer Res., 56, 330-338 (1996), Cancer Res., 58, 
15 512-518 (1998)]. 

Accordingly, it is considered that diagnosis of diseases 
or prediction of prognosis can be performed by examining the 
polymorphism of Fuc-TIII. 

Since the novel |3l , 3-galactosyltransf erase of the 
20 present invention is involved in the synthesis of sialyl-Lewis 
a sugar chains or sialyl-Lewis c sugar chains in colon cancers 
or pancreatic cancers, diagnosis of colon cancers and 
pancreatic cancers or prediction of prognosis can be performed 
by examining the polymorphism of this gene. 
25 Further, diagnosis of other diseases can also be 

performed by examining the relationship of the polymorphism 
of this gene with organs (stomach, jejunum, colon, pancreas, 
etc.) expressing this gene. 

Analysis of the polymorphism of this gene can be 
30 performed using the sequence information on this gene. 

Specifically, the polymorphism of the gene can be analyzed by 
Southern blotting, direct sequencing, PCR, a DNA chip method, 
etc. [Rinsho Kensa (Clinical Examination) , 12, 1507-1517 
(1998); Rinsho Kensa, 42, 1565-1570 (1998)]. 
35 (6) Use of the polypeptide of the present invention 

(a) Use thereof for preparation of the antibody of the present 
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invention 

The polypeptide of the present invention can be used to 
prepare the antibody of the present invention by the method 
described in item (4) above. 
5 (b) Use of the polypeptide of the present invention for 
production of sugar chains and complex carbohydrates 

The polypeptide of the present invention can be used to 
produce sugar chains having galactose transferred via 
fJl,3-linkage to N-acetylglucosamine, N-acetylglucosamine 

10 residue, glucose or glucose residue thereof, or complex 
carbohydrates having said sugar chains added thereto, 
(c) Use thereof for screening a substance participating in the 
activity of the polypeptide of the present invention 

The polypeptide of the present invention can be used for 

15 screening a compound enhancing or inhibiting the activity of 
said polypeptide by the method (a) described in item (8) below. 
(7) Use of the antibody of the present invention 
(a) Immunological detection and guantif ication of the 
polypeptide of the present invention by the antibody of the 

20 present invention 

The method for immunological detection of the 
polypeptide of the present invention includes EL ISA using a 
microtiter plate, fluorescence antibody technique, Western 
blotting, immunohistostaining, etc. 

25 The immunological quantification method includes the 

sandwich ELISA using two monoclonal antibodies which react 
different epitopes of the polypeptide of the present invention 
in a liquid phase and the radioimmunoassay using the 
polypeptide of the present invention labeled with 

30 radioisotopes such as 126 I and the antibody recognizing the 
polypeptide of the present invention. 

The detection method or quantification method described 
above can be applied to diagnosis of colon cancers, pancreatic 
cancers, etc. 

35 (b) Pharmaceutical preparation containing the antibody of the 
present invention 
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The antibody of the present invention can be used as an 
agent for treating diseases such as colon cancers, pancreatic 
cancers, etc. 

As the pharmaceutical preparation comprising the 
5 antibody of the present invention, said compound can be 

administered alone as a therapeutic agent, but usually it is 
mixed with pharmacologically acceptable, one or more carriers 
and provided as a pharmaceutical preparation produced by a 
well-known method in the technical field of pharmaceutical 

10 manufacturing. 

The administration route is preferably the most 
effective route for treatment, and includes oral 
administration and parenteral administration such as 
intra-oral cavity administration, intratracheal 

15 administration, intrarectal administration, subcutaneous 

administration, intramuscular administration and intravenous 
administration. The administration form includes sprays, 
capsules, tablets, granules, syrups, emulsions, suppositories, 
injections, ointments and tapes. 

20 The pharmaceutical preparation suitable for oral 

administration includes emulsions, syrups, capsules, tablets, 
powders and granules. For example, liquid preparations such 
as emulsions and syrups can be produced by using water, 
saccharides such as sucrose, sorbitol and fructose, glycols 

25 such as polyethylene glycol and propylene glycol, oils such 
as sesame oil, olive oil and soybean oil, preservatives such 
as p-hydroxybenzoate and flavors such as strawberry flavor and 
peppermint as additives. The capsules, tablets, powders and 
granules can be produced by using excipients such as lactose, 

30 glucose, sucrose and mannitol, disintegrating agents such as 
starch and sodium alginate, lubricants such as magnesium 
stearate and talc, binders such as polyvinyl alcohol, 
hydroxypropyl cellulose and gelatin, surfactants such as fatty 
acid esters and plasticizers such as glycerin as additives. 

35 The pharmaceutical preparation suitable for parenteral 

administration includes injections, suppositories and sprays. 
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For example, the injections are prepared by using carriers such 
as a salt solution, glucose solution or a mixture of the two. 
The suppositories are produced by using carriers such as cocoa 
butter, hydrogenated fats and carboxylic acids. The sprays 
5 are prepared from said compound alone or by using carriers for 
dispersing said compound as fine particles to facilitate 
absorption thereof without stimulating the oral cavity or 
tracheal mucous membrane of a recipient. Specific examples 
of such carriers include lactose and glycerin. Depending on 

10 said compound and the properties of the carriers used, 
preparations such as aerosols and dry powders can be 
manufactured. In these parenteral preparations, the 
ingredients exemplified as additives in the oral preparations 
can also be added. 

15 The dose and administration frequency are varied 

depending on the desired therapeutic effect, administration 
method, period of treatment, age, body weight, etc., but the 
dose is usually 10 jxg/kg to 8 mg/kg per day in an adult. 
(8) Application to screening methods 

20 The novel |3l, 3-galactosyl transferase polypeptide of the 

present invention is involved in the synthesis of type 1 sugar 
chains such as sialyl-Lewis a sugar chain, sialyl-Lewis c sugar 
chain, Lewis a sugar chain and Lewis b sugar chain in cancer 
cells in the digestive system, such as colon cancer cells and 

25 pancreatic cancer cells, and thus a compound enhancing or 
inhibiting the activity of said polypeptide can be used to 
increase or decrease the amount of the type 1 sugar chains 
synthesized in the cells. 

Further, a compound promoting or suppressing the 

30 transcription process for the gene coding for said polypeptide 
or the translation process from the transcript to the protein 
can regulate expression of said polypeptide, to regulate the 
amount of type 1 sugar chains synthesized in the cells. 

The compound suppressing the amount of type 1 sugar 

35 chains synthesized is considered to be useful for suppression 
of cancer metastasis. On the other hand, the compound 
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increasing the amount of type 1 sugar chains synthesized is 
considered to be useful for synthesis of type 1 sugar chains. 

The compounds described above can be obtained by the 
following methods (a) to (e) : 
5 (a) The novel polypeptide of the present invention having 
pi, 3-galactosyltransf erase activity prepared by the method 
described in item (2) above (purified product, a cell extract 
or a culture supernatant from a transformant expressing said 
polypeptide) is used as the enzyme, and in the presence of a 

10 test sample, its pi , 3-galactosyltransf erase activity is 
measured according to known methods [J. Biol. Chem. , 258 , 
9893-9898 (1983), J. Biol. Chem., 2£2, 15649-15658 (1987), 
Archi. Biochem. Biophys . , 210., 630-646 (1989), Archi . Biochem. 
Biophys., 274 , 14-25 (1989), Japanese Published Unexamined 

15 Patent Application No. 181759/94, J. Biol. Chem., 273, 58- 
65 (1998), J. Biol. Chem., 223, 433-440 (1998), J. Biol. Chem., 
273 , 12770-12778 (1998)], whereby a test compound having the 
activity of increasing or decreasing the |3l,3- 
galactosyltransf erase activity is selected and obtained. 

20 (b) Cells expressing the polypeptide of the present invention 
or the transformant described in item (2) above are cultured 
for 2 hours to 1 week in the presence of a test sample by the 
culturing method described in item (2) above, and the amount 
of type 1 sugar chains such as sialyl-Lewis a sugar chain, 

25 sialyl-Lewis c sugar chain, Lewis a sugar chain and Lewis b 
sugar chain on the cell surface is determined using an antibody 
against each sugar chain, whereby a test compound having the 
activity of increasing or decreasing the amount of said sugar 
chains is selected and obtained. 

30 The determination method using the antibody includes, 

e.g., detection methods such as ELISA using a microtiter plate, 
fluorescence antibody technigue, Western blotting and 
immunohistostaining . 

(c) Cells expressing the polypeptide of the present invention 
35 are cultured for 2 hours to 1 week in the presence of a test 
sample by the culturing method described in item (2) above, 
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and then the amount of said polypeptide in the cells is 
determined using the antibody of the present invention 
described in item (4) above, whereby a test compound having 
the activity of increasing or decreasing the amount of said 
5 polypeptide is selected and obtained. 

The determination method using the antibody of the 
present invention includes, e.g., detection methods such as 
EL ISA using a microtiter plate, fluorescence antibody 
techniques, Western blotting and immunohistostaining . 

10 (d) Cells expressing the polypeptide of the present invention 
are cultured for 2 hours to 1 week in the presence of a test 
sample by the culturing method described in item (2) above, 
and then the amount of transcripts produced from the gene coding 
for said polypeptide in the cells is determined by the method 

15 described in item (5) above such as Northern hybridization, 
PCR or the like, whereby a test compound having the activity 
of increasing or decreasing the amount of the transcripts is 
selected and obtained. 

(e) A plasmid carrying the DNA to which a reporter gene is 

20 ligated downstream of a promoter obtained in item (4) above 
is prepared in the known method, and the plasmid is introduced 
into the animal cells described in item (2) above by the method 
described in item (2) above whereby a transformant is obtained. 
The transformant is cultured for 2 hours to 1 week in the 

25 presence of a test sample by the culturing method described 
in item (2) above, and the expression level of the reporter 
gene in the cells is determined by the known method ["Shin Saibo 
Kogaku Jikken Protocol" (New Experimental Protocol in Cell 
Engineering) , edited by Department of Anticancer Research, 

30 Medical Research Institute, Tokyo University and published by 
Shujunsha (1993); Biotechniques , 20, 914 (1996); J. 
Antibiotics, 4_9_, 453 (1996); Trends in Biochemical Sciences, 
20, 448 (1995); "Saibo Kogaku" (Cell Engineering), 16, 581 
(1997)], whereby a test compound having the activity of 

35 increasing or decreasing the expression level is selected and 
obtained . 
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Examples of reporter genes include chloramphenicol 
acetyltransf erase gene, |3-glucuronidase gene, (3- 
galactosidase gene, ^-lactamase gene, luciferase gene, 
aequorin gene and green fluorescent protein (GFP) gene. 
5 (9) Creation of knockout non-human animals 

Using a vector containing the DNA of the present 
invention, mutant clones in which the DNA coding for the 
polypeptide of the present invention on the chromosome is 
inactivated or replaced by an arbitrary sequence by known 

10 homologous recombination techniques [e.g., Nature, 326 , 6110, 
295 (1987) , Cell, 51, 3, 503 (1987) , etc. ] can be produced from 
embryonic stem cells in objective non-human animals such as 
cattle, sheep, goats, pigs, horses, chickens, mice, etc. [e.g., 
Nature, 150, 6315, 243 (1991)]. 

15 The embryonic stem cell clones thus produced and 

blastocyst of fertilized eggs of non-human animals can be used 
for producing chimeras composed of embryonic stem cell clones 
and normal cells by techniques such as injection chimera method, 
aggregation chimera method, etc. 

20 By crossing said chimeras with normal individuals, 

individuals in which the DNA coding for the polypeptide of the 
present invention on the chromosome in cells in the whole body 
is arbitrarily mutated can be obtained, and by crossing said 
individuals, homozygotes (knockout non-human animals) having 

25 the mutation on both homologous chromosomes can be obtained. 

In this manner, the DNA coding for the polypeptide of 
the present invention on the chromosome in the animals can be 
mutated in an arbitrary position. For example, the 
translational region of the DNA coding for the polypeptide of 

30 the present invention on the chromosome can be mutated by base 
substitution, deletion, insertion, etc. to alter the activity 
of its product. 

Further, by similar mutation of its expression- 
regulating region, the degree of expression, stage, tissue 

35 specificity, etc. can also be modified. Furthermore, by 

combination with the Cre-loxP system, the expression stage, 
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expression site, expression level, etc. can also be regulated 
highly. 

By way of example, it is known that by using a promoter 
to be expressed in a specific region in the brain, an objective 
5 gene is deleted in only that region [Cell, 87, 7, 1317 (1996) ] , 
or by using adenovirus expressing Cre, an objective gene is 
deleted in an organ-specific manner at a desired stage [Science, 
278 , 5335 (1997)]. 

It is, therefore, possible to create knockout non-human 

10 animals in which expression of the DNA coding for the 

polypeptide of the present invention on the chromosome can be 
regulated at an arbitrary stage or in a specific organ as 
described above, or the translational region or 
expression-regulating region contains arbitrary insertion, 

15 deletion or substitution. 

In such knockout non-human animals, various diseases 
attributable to the polypeptide of the present invention can 
be induced at an arbitrary stage, at an arbitrary degree or 
in an arbitrary site. 

20 Accordingly, the knockout non-human animal of the 

present invention can serve as a very useful animal model in 
treating or preventing various diseases attributable to the 
polypeptide of the present invention. In particular, the 
animal is very useful as a model for evaluation of agents for 

25 treating or preventing the diseases, functional foods, health 
foods, etc. 

Brief Description of Drawings 

Fig. 1, A shows the results of measurement of the 

30 expression levels of type 1 sugar chains ( sialyl-Lewis a sugar 
chain, Lewis a sugar chain, Lewis b sugar chain) in a wide 
variety of human cancer cell lines. Each kind of cell was 
stained with a fluorescent antibody using anti-sialyl-Lewis 
a sugar chain antibody (19-9), anti-Lewis a sugar chain 

35 antibody (7LE) or anti-Lewis b sugar chain antibody 

(Neokokusai) and then analyzed by FACS . The reactivity with 
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each antibody is shown as +++, ++, +, ±, and -, in the order 
of decreasing reactivity. - means the absence of 
reactivity with the antibody. NT means that the analysis was 
not conducted. 

5 Fig. 1, B shows the results of determination of the amount 

of transcripts of human (33Gal-Tl, human [33Gal-T2, human 
|33Gal-T3 and human p*3Gal-T4 in various human cancer cell lines 
by the quantitative PGR method. The amounts of the respective 
|31, 3-galactosyltransf erase gene transcripts in various cell 
10 lines are expressed as values relative to the amount ( = 1000) 
of the p-actin transcript considered to be expressed at a 
similar degree in any cells. 

Fig. 2 shows the process of construction of plasmid 
pBS-3GT5 . 

15 Fig. 3, A shows the results of indirect fluorescent 

antibody staining with ant i-sialyl-Lewis c sugar chain 
antibody (DU-PAN-2) and subsequent FACS analysis of Namalwa 
cells [Namalwa (mock) ] having the control plasmid (pAMo) 
introduced therein, or Namalwa cells (Namalwa-3GT5 ) having 

20 human |33Gal-T5 expression plasmid (pAMo-3GT5) introduced 

therein. The shaded histogram shows the result of analysis 
using A-PBS in place of DU-PAN-2. 

Fig. 3, B shows the results of indirect fluorescent 
antibody staining with anti-sialyl-Lewis a sugar chain 

25 antibody (19-9) , anti-sialyl-Lewis c sugar chain antibody 
(DU-PAN-2), anti-Lewis a sugar chain antibody (7LE) or 
anti-Lewis b sugar chain antibody (Neokokusai) and subsequent 
FACS analysis of HCT-15 cells [HCT15 (mock) ] having the control 
plasmid (pAMo) introduced therein, or HCT-15 cells (HCT-3GT5H) 

30 having human |33Gal-T5 expression plasmid (pAMo-3GT5) 

introduced therein. The shaded histogram shows the result of 
analysis using A-PBS in place of DU-PAN-2. 

Fig. 4, A shows the results of quantification of human 
(33Gal-T5 transcripts in various human cancer cell lines by the 

35 quantitative PCR method. The amounts of the human |33Gal-T5 
transcripts in various cell lines are expressed as values 
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relative to the amount ( = 1000) of the p-actin transcript 
considered to be expressed at a similar degree in any cells. 

Fig. 4, B shows the results of examination of expression 
of CA19-9 antigen-containing proteins in various human cancer 
5 cell lines by Western blotting analysis. 

Fig. 5 shows the results of quantification of human 
|33Gal-T5 transcripts in various human tissues by the 
quantitative PCR method. The amounts of the human |33Gal-T5 
transcripts in various human tissues are expressed as values 
10 relative to the amount ( = 1000) of the (3-actin transcript 
considered to be expressed at a similar degree in any cells. 

Fig. 6, A shows the structure of human |33Gal-T5 
chromosomal gene. Four exons are shown by square and introns 
by line. An Xbal site is present in exon 2, and a Bsm I site 
15 is present in exon 3. The coding region (open reading frame) 
is shown by oblique line . The positions of primers (si-1, si-2, 
si-3, si-4) used for analysis of isoforms of human |33Gal-T5 
cDNA are shown by arrow. 

Fig. 6, B shows the structure of isoforms of human 
20 |33Gal-T5 cDNA. Their proportion is shown in percentage of the 
amount of each isoform expressed in Colo205 cells. 

Fig. 6, C shows the results of examination of the 
expression level of each isoform of human (33Gal-T5 cDNA in 
Colo205 cells by the RT-PCR method . After RT-PCR was conducted 
25 with the combination of primers shown in Fig. 6, A, the reaction 
product was cleaved with a restriction enzyme ( Xba l or BsmI ) 
shown in Fig. 6, to specify the isoform. "none" means that 
the restriction enzyme treatment was not conducted. The left 
line shows molecular markers (100 bp ladder) . 

30 

Best Mode for Carrying Out the Invention 

Examples are shown below. Unless otherwise specified, 
the methods described in Molecular Cloning, 2nd edition were 
used as techniques in gene manipulation. 

35 



Example 1. Measurement of the expression levels of type 1 
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sugar chains and known pi , 3-galactosyltransf erase gene in 
various cell lines 

By measuring the expression levels of type 1 sugar chains 
(sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis b sugar 
5 chain) and the known human pi , 3-galactosyltransf erase genes 
in various human cancer cell lines, an attempt was made to 
identify the pi , 3-galactosyltransf erase involved in the 
synthesis of type 1 sugar chains in colon cancer cell lines 
or pancreatic cancer cell lines. 

10 Measurement of the expression levels of type 1 sugar 

chains (sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis 
b sugar chain) in various cell lines was conducted by 
fluorescent antibody staining with anti-sialyl-Lewis a sugar 
chain antibody, anti-Lewis a sugar chain antibody or anti- 

15 Lewis b sugar chain antibody, followed by analysis by FACS (Fig. 
1, A) . 

Quantification of transcripts from the known pi,3- 
galactosyltransferase (P3Gal-Tl, p3Gal-T2, p3Gal-T3, 
p3Gal-T4) genes in various human cell lines was conducted using 

20 the RT-PCRmethod [Proc. Natl. Acad. Sci. USA. , 87, 2725 (1990) , 
J. Biol. Chem., 269 r 14730 (1994), Japanese Published 
Unexamined Patent Application No. 181759/94, J. Biol. Chem., 
273, 26729 (1998)]. The amounts of transcripts from the 
respective pi, 3-galactosyltransf erase genes in the respective 

25 cell lines are shown as values relative to the amount ( = 1000) 
of the p-actin transcript considered to be expressed at a 
similar degree in any cells (Fig. 1, B) . 

(1) Measurement of the expression levels of type 1 sugar chains 
(sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis b sugar 

30 chain) in various cell lines 

The cell lines used were colon cancer cell lines (Colo205, 
Colo201, SW1116, LS180, HT29, WiDr, HCT-15, SW480, SW620), 
pancreatic cancer cell lines (Capan-1, Capan-2), stomach 
cancer cell lines (KATOIII, MKN45, MKN74), a lung cancer cell 

35 line (PC-1), neuroblastoma cell lines (SK-N-MC, SK-N-SH) , 

lymphoma cell lines (Namalwa, Jurkat) and a prostate cancer 
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cell line (PC-3) . 

Colo205, Colo201, LS180, HT29, WiDr, HCT-15, SW480, 
SW620, Capan-1, Capan-2, KATOIII, MKN45, MKN7 4 , PC-1, SK-N-MC, 
SK-N-SH, Namalwa and PC-3 were obtained from American Type 
5 Culture Collection (ATCC) . Further, SW1116 (available from 
ATCC) and Jurkat (available from RIKEN GENE BANK) were obtained 
from Dr. Takahashi, Aichi Cancer Center, JP. 

The cells described above were cultured in a medium 
suitable for the respective cells, and then the cells were 
10 subjected to fluorescent antibody staining with anti- 

sialyl-Lewis a sugar chain antibody (19-9) , anti-Lewis a sugar 
chain antibody (7LE) or anti-Lewis b sugar chain antibody 
(Neokokusai) and analyzed by FACS. 

The specific method is as follows: 
15 About 1x10 s cells were placed in a microtube (1.5 ml, 

produced by Eppendorf) and the cells were collected by 
centrif ugation (550xg, 7 minutes) . 

The cells were washed with 0.9 ml of phosphate buffer 
solution containing 0.1% sodium azide (A-PBS: 8 g/1 NaCl, 0.2 
20 g/1 KC1, 1.15 g/1 Na 2 HP0 4 (anhydride), 0.2 g/1 KH 2 P0 4 , 0.1% 
sodium azide) . After 20 \il of anti-sugar chain antibody 
previously diluted to about 10 \x,g/ml with A-PBS was added to 
the washed cells, the cells were suspended and reacted at 4°C 
for 1 hour. 

25 After the reaction, the cells were washed once with 0.9 

ml of A-PBS and then suspended in 20 jil of a solution previously 
prepared by diluting anti-mouse IgM/IgG antibodies labeled 
with fluorescein isothiocyanate (FITC) (a product of Bio-Rad) 
16-fold with A-PBS, followed by reaction at 4°C for 30 minutes. 

30 After the reaction, the cells were washed once with 0.9 

ml of A-PBS, then suspended in 0 . 6 ml of A-PBS and analyzed 
by a fluorescence activated cell sorter (EPICS Elite Flow 
Cytometer manufactured by COULTER) . In a control experiment, 
the same analysis was conducted by use of A-PBS in place of 

35 the anti-sugar chain antibody. 

The results are shown in Fig. 1, A. It was confirmed 
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that in the colon cancer cell lines, Colo205, Colo201 and SW1116 
and pancreatic cancer cell line, Capan-2, type 1 sugar chains 
are expressed in larger amounts. 

(2) Quantification of transcripts from known |31,3- 
5 galactosyltransferase (|33Gal-Tl, p3Gal-T2, (33Gal-T3, 
P3Gal-T4) genes in various human cell lines 

(a) Preparation of single-stranded cDNA derived from various 
cell lines 

From the cell lines described in item (1) above, total 
10 RNAs were extracted by the acid guanidium thiocyanate 

phenol-chloroform method [Anal. Biochem. , 162, 156-159]. 

Deoxyribonuclease I (a product of Life Technologies 
Ltd. ) (5 U/ral) was added to 6 \ig each of total RNAs and reacted 
at room temperature for 5 minutes. After the reaction, the 
15 mixture was heated at 65°C for 15 minutes to inactivate the 
enzyme . 

From each of the resulting total RNAs, cDNA was 
synthesized using an oligo (dT) primer by SUPERSCRIPT™ 
Preamplif ication System for First Strand cDNA System (a product 
20 of Life Technologies Ltd.) . The reaction was conducted in 20 
Hi solution, and the solution after the reaction was diluted 
50-fold with water and stored at -80°C until use. 

(b) Preparation of standards and internal controls 

As the standards used for preparation of a calibration 
25 curve, linear DNAs were prepared by integrating cDNAs of human 
P3Gal-Tl, human |33Gal-T2, human |33Gal-T3 and human (33Gal-T4 
into pUC119 or pBluescript SK(-) and cleaving the resulting 
plasmids (pUC119-3GTl, pBS-3GT2, pBS-3GT3, pBS-3GT4) with 
suitable restriction enzymes to take the cDNA inserts. 
30 pUC119-3GTl is identical with plasmid pUC119-WMl (FERM 

BP-4011) described in Japanese Published Unexamined Patent 
Application No. 181759/94. The cDMAs of human p3Gal-T2, human 
p3Gal-T3 and human £3Gal-T4 were obtained in the following 
manner . 

35 A fragment of each cDNA was obtained by PGR using primers 

specific to the sequence of each cDNA. 
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By colony hybridization or plaque hybridization using 
each cDNA fragment thus obtained as a probe, each cDNA was 
obtained. The nucleotide sequences of human j33Gal-T2, human 
|33Gal-T3 and human |33Gal-T4 were determined by DNA sequencer 
5 model 4000L (a product of LI-COR Ltd.) or DNA sequencer 377 
(a product of Perkin Elmer) using a reaction kit for each 
sequencer and it was confirmed that each cDNA codes for human 
p3Gal-T2, human |33Gal-T3 and human (33Gal-T4, respectively. 
Each cDNA could also be obtained by PCR on the basis of its 
10 known sequence. 

As the standard for quantification of the (3-actin 
transcript, a linear DNA was prepared by cleaving pUC119-ACT 
with restriction enzymes (Hindi 1 1 and Asp_718) to take the cDNA 
insert [J. Biol. Chem. , 269 , 14730 (1994), Japanese Published 
15 Unexamined Patent Application No. 181759/94]. 

As the internal controls, linear DNAs were prepared by 
cleaving plasmids prepared below (pBS-3GTld, pBS-3GT2d, 
pBS-3GT3d, pBS-3GT4d) with suitable restriction enzymes to 
take the cDNA inserts. 
20 By deleting a 212-bp sequence between Ban ll- Eco RV in 

human p3Gal-Tl cDNA in pUC119-3GTl, pUC119-3GTld was prepared. 

By deleting a 258-bp sequence between Aflll-BstEII in 
human p3Gal-T2 cDNA in pBS-3GT2, pBS-3GT2d was prepared. 

By deleting a 183-bp sequence between Sty l- Sty l in human 
25 (33Gal-T3 cDNA in pBS-GT3, pBS-3GT3d was prepared. 

By deleting a 253-bp sequence between Acc III- Sty l in 
human |33Gal-T4 cDNA in pBS-3GT4, pBS-3GT4d was prepared. 

As the internal control for quantification of the |3- 
actin transcript, a linear DNA was prepared by cleaving 
30 pUC119-ACTd with restriction enzymes (Hindi 1 1 and Asp_718) to 
take the cDNA insert [J. Biol. Chem., 269, 14730 (1994), 
Japanese Published Unexamined Patent Application No. 
181759/94] . 

(c) Quantification of transcripts by RT-PCR 
35 Reaction solution [ 10 mmol/1 Tris-HCl (pH8.3), 50 mmol/1 

KC1, 1 . 5 mmol/1 MgCl 2 , 0 . 2 mmol/1 dNTP, 0.001% (w/v) gelatin, 
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0.2 pmol/1 gene-specific primers] (50 containing 10 \xl of 
each tissue-derived cDNA and 10 |il (10 fg) of each internal 
control plasmid described above was subjected to PCR with DNA 
polymerase AmpliTaq Gold™ (a product of Perkin Elmer) . 
5 The nucleotide sequence of each gene-specific primer is 

shown in Table 1. Further, the nucleotide sequences of 
{J3Gal-Tl gene-specific primers are represented by SEQ ID NOS: 
4 and 5, the nucleotide sequences of |33Gal-T2 gene-specific 
primers by SEQ ID NOS: 6 and 7, the nucleotide sequences of 
10 p3Gal-T3 gene-specific primers by SEQ ID NOS: 8 and 9, the 
nucleotide sequences of [33Gal-T4 gene-specific primers by SEQ 
ID NOS: 10 and 11, and the nucleotide sequences of the (3-actin 
specific primers by SEQ ID NOS: 12 and 13. 

15 Table 1. Conditions and Primers Used in Quantitative PCR 



Target 






*- . 




PCR product (bp) 


Anneal ing 


gene 






mmer set 

Target 


Corapet i tor 


temp. CO 


&3Gal-Tl 


F 
R 


5 
5 


-TTCAGCCACCTAACAGTTGCCAGG-3 ' 
-ATACCTTCTTCGTGGCTTGGTGGAG-3 ' 


495 


283 


60 


p3Gal-T2 


F 
R 


5 
5 


-TAGAAGCTAGAAGAGCTATTCGGC-3 ' 
-ACTCGCCAGTGATTGAACACAAAC-3 ' 


616 


358 


60 


03Gal-T3 


F 
R 


5 
5 


- C CCAATGCCAAGTACGTAATGAAG - 3 ' 
-TGTGGTGTTCCTTAGCATGACCTG-3 ' 


474 


291 


60 


(33Gal-T4 


F 
R 


5 
5 


-TTGATCCCCAACCAGGAAGCTTGC-3 ■ 
-TGAGGCCACTGCTCCTCTGATACG-3 ' 


590 


337 


68 


@3Gal-T5 


F 

R 


5 
5 


-ACCACCAGCAGTGCAGCGGAAAC- 3 ' 
-GCCACGATCCTCCTGAAGAGGCA-3 ' 


554 


410 


65 


fi-Actin 


F 
R 


5 
5 


-GATATCGCCGCGCTCGTCGTCGAC-3 ' 
-CAGGAAGGAAGGCTGGAAGAGTGC-3 ' 


789 


639 


60 



* F : forward pri 



R : reverse primer 



By the primer sets shown above, DNA fragments of sizes 
shown in target in Table 1 can be amplified from the respective 
gene transcripts and the respective standards. By the primer 
sets shown above, DNA fragments of sizes shown in competitor 
in Table 1 can be amplified from the respective internal 
controls . 
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After heating at 95°C for 11 minutes, PGR was conducted 
using 42-cycle reaction for the (31 , 3-galactosyltransf erase 
gene group and 24-cycle reaction for the (3-actin, each cycle 
consisting of reaction at 95°C for 1 minute, at an annealing 
5 temperature shown in Table 1 suitable for each gene for 1 minute, 
and at 72°C for 2 minutes. 

Each solution (10 after PCR was electrophoresed on 

1 % agarose gel, stained by ethidium bromide and photographed. 
By scanning each photograph by an NIH image system, the density 
10 of staining of each amplified fragment was determined as the 
amplification amount . 

PCR was conducted similarly using 1.25 fg, 2.5 fg, 5 fg, 
10 fg, 20 fg and 40 fg of standard plasmid prepared above in 
place of the cell-derived cDNA, to determine the amount of the 
15 amplified fragment, and by plotting the amount of the amplified 
fragment vs. the amount of the cDNA, a calibration curve was 
prepared. 

From this calibration curve and the amounts of the 
fragments amplified from cDNAs derived from various cells, the 

20 amounts of the cDNAs in various cells were calculated and 
assumed to be the amounts of mRNAs transcribed in the various 
cells, that is, the expression levels of the genes. Because 
the (3-actin gene is considered to be a gene expressed 
universally in various cells, its expression level is 

25 considered similar in any cells. Accordingly, a difference 
in the expression level of the (3-actin gene among the respective 
cells is considered attributable to the difference of 
efficiency of cDNA synthesis reaction, and thus when the 
expression levels of the (31 , 3-galactosyltransf erase genes 

30 were compared, the expression level of the |3-actin gene was 
also taken into consideration. 

The amounts of the respective (31,3- 
galactosyltransf erase gene transcripts in various cell lines 
are shown as values relative to the amount ( = 1000) of the 

35 p-actin transcript (Fig. 1, B) . 

03Gal-Tl, |33Gal-T2 and (33Gal-T3 have the activity of 
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synthesizing Galj31-3GlcNAc structure, but they were hardly 
expressed in the colon cancer cell lines (Colo205, Colo201, 
SW1116) and pancreatic cancer cell line (Capan-2) expressing 
large amounts of type 1 sugar chains. On the other hand, 
5 |33Gal-T4 was expressed in the colon cancer cell lines (Colo205, 
Colo201, SW1116) and pancreatic cancer cell line (Capan-2), 
but it has been revealed that this enzyme has no activity of 
synthesizing Galp*l-3GlcNAc structure. 

From these results, it was revealed that (31,3- 
10 galactosyltransf erase involved in the synthesis of type 1 sugar 
chains in these cell lines is a novel enzyme. 

Example 2. Cloning of a gene (cDNA) coding for the novel 
polypeptide having pi , 3-galactosyltransf erase activity, 
15 participating in the synthesis of type 1 sugar chains such as 
sialyl-Lewis a sugar chain in cancer cells in the digestive 
system, such as colon cancer cells or pancreatic cancer cells 
(1) Acquisition of mRNA from human colon cancer cell line 
Colo205 

20 About 30 \ig of mRNA was obtained from human colon cancer 

cell line Colo205 by use of a mRNA extraction kit, 

Oligotex™-dT30<super> (a product of Roche) . 

The reagents and method employed are those described in 

instructions attached to the kit. 
2 5 (2) Preparation of a cDNA library derived from human colon 

cancer cell line Colo205 

Eight micrograms of the mRNA derived from human colon 

cancer cell line Colo205, obtained in item (1) above, and a 

kit from Gibco BRL (SUPERSCRIPT Choice System for cDNA 
30 Synthesis) were used to synthesize double-stranded cDNA by use 

of oligo-dT as a primer. 

Sf i I linkers were added to both termini of the 

double-stranded cDNA in the following manner. 

[Addition of Sf i I linkers] 
35 The single-stranded DNA represented by SEQ ID NO: 14 and 

the single-stranded DNA represented by SEQ ID NO: 15 were 
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synthesized by 380A'DNA synthesizer (a product of Applied 
Biosystems) . 

Each of the synthesized single-stranded DNAs (50 \ig) was 
separately dissolved in 50 ^il of a buffer containing 50 mmol/1 
5 Tris-HCl buf f er (pH7.5), 10 mmol/1 MgCl 2 , 5 mmol dithiothreitol 
(abbreviated hereinafter to DTT) ,0.1 mmol/1 EDTA and 1 mmol/1 
ATP (this buffer is referred hereinafter to T4 kinase buffer) , 
followed by addition of 30 U of T4 polynucleotide kinase (a 
product of Takara Shuzo Co., Ltd.) to allow phosphorylation 
10 reaction at 37°C for 16 hours whereby 11-bp and 8-bp linkers 
were obtained, respectively. 

Four micrograms of the 11-bp linker, 2.9 ^g of the 8-bp 
linker and the double-stranded cDNAs synthesized above were 
dissolved in 45 \il of T4 ligase buffer, and 1050 U of T4 DNA 
15 ligase was added thereto followed by reaction at 16°C for 16 
hours, and the Sf i I linkers were added to each of the 
double-stranded cDNAs . 

The resulting reaction mixture was subjected to agarose 
gel electrophoresis and about 1.5-kb or more DNA fragment was 
20 recovered. 

Twenty-four micrograms of expression cloning vector 
pAMo [J. Biol. Chem., 268, 22782 (1993), also called pAMoPRC3Sc 
(Japanese Published Unexamined Patent Application No. 
336963/93)] was dissolved directly in 590 |^1 of a buffer 
25 containing 10 mmol/1 Tris-HCl (pH 7.5), 6 mmol/1 MgCl 2 , 50 
mmol/1 NaCl and 6 mmol/1 2-mercaptoethanol (this buffer is 
referred to hereinafter as Y-50 buffer) , and 80 U of restriction 
enzyme Sf i I (a product of Takara Shuzo Co., Ltd.; all 
restriction enzymes used hereinafter are products of Takara 
30 Shuzo Co., Ltd. unless otherwise specified) was added to allow 
digestion reaction at 37°C for 16 hours. 

Forty units of BamHI was added to the reaction mixture, 
followed by digestion reaction at 37°C for 2 hours. The 
reaction mixture was subjected to agarose gel electrophoresis, 
35 and about 8.8-kb DNA fragments were recovered. 

The DNAs prepared above (each derived from 8 |xg of mRNA) 
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having Sf i I linkers added thereto were separately dissolved 
in 250 |xl of T4 ligase buffer, and to each mixture were added 
2 \xg of about 8.8-kb DNA fragments obtained above and 2000 U 
of T4 DNA ligase, followed by ligation reaction at 16°C for 
16 hours. 

After the reaction, 5 \iq of transfer RNA (tRNA) was added 
to each reaction mixture, and the sample was precipitated with 
ethanol and the obtained precipitate was dissolved in 20 \xl 
of a buffer containing 10 mmol/1 Tris-HCl buffer (pH 8.0) and 
1 mmol/1 EDTA (sodium ethylenediamine tetraacetate) (this 
buffer is referred to hereinafter as TE buffer) . 

The reaction solution was used to transform E. coli LE392 
(Molecular Cloning, 2nd edition) by the electroporat ion method 
[Nucleic Acids Res., 16, 6127 (1988)], and about 1,000,000 
ampicillin-resistant transf ormants were obtained, to prepare 
a cDNA library. 

Using said cDNA library (E. coli) and a plasmid maxi kit 
(Product No. 41031), that is, a plasmid preparation kit 
produced by Qiagen, a plasmid containing the cDNA was prepared. 
(3) Acquisition of a cDNA fragment for the novel pi,3- 
galactosyltransf erase by use of degenerate primers 

By comparing the amino acid sequences of known four 
pi, 3-galactosyltransferases (|33Gal-Tl, p3Gal-T2, p3Gal-T3, 
P3Gal-T4), three or more sites each having a well-conserved 
amino acid sequence in the known four pi,3- 
galactosyltransf erases were found. Said three sites are 
called motifs 1, 2 and 3, respectively, from the N-terminus . 
The amino acid sequence of each motif in the four pi,3- 
galactosyltransf erases and the position of the first amino acid 
of each motif from the N-terminus are shown in Table 2. 
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Table 2. Amino Acid Sequence Motifs Conserved in p3Gal-T Family 





M/^-t- i -F 1 


Motif 2 


Motif 3 


p3Gal-Tl 


a" iretwg 


Y 172 VMKTDSD 


E 264 DVYVGLC 


p3Gal-T2 


A 169 IRQTWG 


Y 247 VMKTDSD 


E 340 DVYVGIC 


p3Gal-T3 


A 96 IRVTWG 


Y 175 VMKTDTD 


E 266 DVYVGIC 


(33Gal-T4 


A 89 IRASWG 


Y 170 VLKTDDD 


E 290 DFYVGVS 



According to known methods [Carl W. Dieffenbach, 
Gabriela S. Dveksler, "PGR Primer: A Laboratory Manual", Cold 
5 Spring Harbor Lab. (1995) , The Protocol Series "cDNA Cloning" 
edited by Jyunichiro Inoue & Kentaro Senba and published by 
Yodosha (1996), Science, 241, 42 (1988)], degenerate primers 
having a nucleotide sequence corresponding to the amino acid 
sequence of each motif were designed. As the forward primer, 

10 two synthetic DNAs (whose sequences are represented by SEQ ID 
NOS: 16 and 17, respectively) corresponding to motifs 1 and 
2 were synthesized. As the reverse primer, two synthetic DNAs 
(whose sequences are represented by SEQ ID NOS: 18 and 19, 
respectively) corresponding to motifs 2 and 3 were synthesized. 

15 PCR was conducted using the DNAs represented by SEQ ID 

NOS: 16 and 18 as the primers and the cDNA library (plasmid) 
prepared in item (2) above as the template, and the amplified 
DNA was blunt-ended with a DNA polymerase Klenow fragment and 
then subcloned into Eco RV site of pBluescript SK (-) (a product 

20 of Stratagene) . In addition, PCR was conducted using the DNAs 
represented by SEQ ID NOS: 17 and 19 as the primers and the 
cDNA library (plasmid) prepared in item (2) above as the 
template, and the amplified DNA was blunt-ended with a DNA 
polymerase Klenow fragment and then subcloned into EcoRV site 

25 of pBluescript SK (-) (a product of Stratagene) . 
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One unit of DNA polymerase AmliTaq Gold™ (a product of 
Perkin Elmer) was added to 50 ^il of reaction solution [10 mmol/1 
Tris-HCl (pH 8.3), 50 mmol/1 KC1, 1.5 mmol/1 MgCl 2 , 0.2 mmol/1 
dNTP, 0.001 % (w/v) gelatin, 0.2 pmol/1 primers] containing 
the cDNA library (100 ng plasmid) prepared in item (2) above, 
and the mixture was subjected to PGR. 

After heating at 95°C for 11 minutes, PCR was conducted 
using 45-cycle reaction, each cycle consisting of reaction at 
95°C for 30 seconds, at 35°C for 1 minute and at 72°C for 2 
minutes . 

The nucleotide sequence of the subcloned, PCR-amplif ied 
fragment was determined using a kit from EPICENTRE TECHNOLOGIES 
(SequiTherm EXCEL II Long-Read DNA Sequencing kit-ALF: Catalog 
No. SE8301A) and an ALF DNA sequencer (a product of Amersham 
Pharmacia Biotech) . 

As a result of PCR using the DNAs represented by SEQ ID 
NOS: 16 and 18 as the primers, one DNA fragment coding for an 
amino acid sequence that is homologous to, but not completely 
consistent with, the amino acid sequence of known £1,3- 
galactosyltransf erase was obtained. 

The sequence of said DNA fragment excluding the primer 
regions agreed with the nucleotide sequence of 643 to 851 in 
the DNA represented by SEQ ID NO: 2. 

Further, one DNA fragment coding for an amino acid 
sequence that is homologous to, but not completely consistent 
with, the amino acid sequence of known |3l,3- 
galactosyltransferase was obtained by PCR using the DNAs 
represented by SEQ ID NOS: 17 and 19 as the primers. 

The sequence of said DNA fragment excluding the primer 
regions agreed with the nucleotide sequence of 876 to 1124 in 
the DNA represented by SEQ ID NO: 2. 
(4) Acquisition of cDNA of the novel (31,3- 
galactosyl transferase 

The two PCR-amplif ied fragments obtained in item (3) were 
mixed and labeled with 32 P by a multi-prime DNA label system 
(a product of Amersham) to prepare a probe. 
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5xl0 5 clones in the cDNA library prepared in item (2) 
above was subjected to colony hybridization with said probe. 

In this hybridization, the filter was washed twice at 
65°C for 10 minutes by shaking it in a buffer consisting of 
5 2-fold cone. SSPE (pH 7.4) [1-fold cone. SSPE is composed of 
180 mmol/1 sodium chloride, 10 mmol/1 sodium dihydrogen 
phosphate, 1 mmol/1 ethylenediaminetetraacetic acid (EDTA) ] 
and 0 . 1% SDS, then washed once at 65°C for 15 minutes by shaking 
it in a buffer consisting of 1-fold cone. SSPE and 0.1% SDS, 
10 and washed twice at 65°C for 10 minutes by shaking it in a buffer 
consisting of 0.2xSSPE and 0.1% SDS. 

As a result of the colony hybridization, two independent 
plasmids hybridizing therewith were obtained. 
(5) Determination of the nucleotide sequence of the cDNA 
15 inserted in plasmid pAMo-3GT5 

The entire nucleotide sequence of the cDNA contained in 
pAMo-3GT5 that is one of the plasmids obtained in item (4) above 
was determined in the following method. 

By using primers specific to a sequence in the pAMo vector, 
20 5'- and 3' -terminal sequences of said cDNA were determined. 

Synthetic DNAs specific to the sequences thus determined 
were prepared, and by using the DNAs as primers, the nucleotide 
sequence of a further region in said cDNA was determined. 

This operation was repeatedly carried out, whereby the 
25 entire nucleotide sequence of said cDNA was determined. 

For nucleotide sequencing, DNA sequencer model 4000L (a 
product of LI-COR Ltd.) and a reaction kit (Sequitherm EXCEL 
II™ Long-Read™ DNA-sequencing kit-Lc: a product of Air Brown) 
or DNA sequencer 377 (a product of Perkin Elmer) and a reaction 
30 kit (ABI Prism™ BigDye™ Terminator Cycle Sequencing Ready 
Reaction kit: a product of Applied Biosystems) were used. 

The entire nucleotide sequence (2775 bp) of the cDNA 
contained in pAMo-3GT5 is represented by SEQ ID NO: 2. 

Said cDNA coded for a polypeptide consisting of 310 amino 
35 acids having a structure characteristic of 
glycosyl transferase . 
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Said polypeptide has 28 to 37% homology at amino acid 
level to the four human pi, 3-galactosyltransf erases (P3Gal-Tl, 
p3Gal-T2, p3Gal-T3, p3Gal-T4) cloned so far, so said 
polypeptide was considered to be a novel pi,3- 
5 galactosyltransf erase . Homology analysis of said amino acid 
sequence was conducted using the sequence analysis software 
GENETYX-MAC 10.1 (a product of Software Kaihatsu Co., Ltd.) . 
Homology (%) was calculated by dividing the number of 
consistent amino acid residues by the number of amino acid 
10 residues of p3Gal-T5. 

The amino acid sequence of said polypeptide is 
represented by SEQ ID NO: 1. 

Said polypeptide was considered to consist of a cytoplasm 
region consisting of N-terminal seven amino acids, a highly 
15 hydrophobic membrane-binding region consisting of 19 amino 
acids, a stem region consisting of at least 4 amino acids, and 
a large C-terminal region containing a catalytic domain. On 
the basis of comparison of its homology in amino acid sequence 
to other pi , 3-galactosyltransf erases and information on the 
20 stem region and catalytic domain of other pi,3- 

galactosyltransf erases [Japanese Published Unexamined Patent 
Application No. 181759/94], it is estimated that the stem 
region consists of at least 4 amino acids. Accordingly, a 
partial polypeptide containing the amino acid sequence of 31 
25 to 310 is considered to contain the catalytic domain. 

Hereinafter, said cDNA is referred to as human p3Gal-T5 
cDNA, and the polypeptide encoded by said cDNA is referred to 
as human p3Gal-T5. 

Human p3Gal-T5 cDNA was cleaved off by cleaving pAMo-3GT5 
30 with Hin dlll and Not I and inserted between Hindlll and NotI 
sites in pBluescript II SK ( + ) , whereby pBS-3GT5 was constructed 
(Fig. 2) . 

Escherichia coli MM2 94 / pBS-3GT5 harboring pBS-3GT5 was 
deposited as FERM BP-6645 on February 10, 1999 with the National 
35 Institute of Bioscience and Human-Technology, Agency of 
Industrial Science and Technology (1-3, Higashi 1-chome, 
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Tsukuba-shi, Ibaraki, 305-8566, Japan) . 

(6) Synthesis of type 1 sugar chains in human cultured cells 
to which human |33Gal-T5 expression plasmid was introduced 

The control plasmid (pAMo) and human (33Gal-T5 expression 
plasmid (pAMo-3GT5) were dissolved at a concentration of 1 
\ig/\il, respectively, in TE buffer and introduced by the 
electroporation method [Cytotechnology , 3., 133 (1990)] into 
Namalwa cells to give transformed cells. 

After introduction of 4 ^g of plasmid/1. 6xl0 6 cells, the 
cells were suspended in 8 ml of RPMI1640 medium containing 10% 
fetal bovine serum [RPMI1640 medium (Nissui Pharmaceuticals, 
Co.) containing 1/40 volume of 7.5% NaHC0 3 , 3% of 200 mmol/1 
L-glutamine solution (GIBCO) and 0 . 5% penicillin-streptomycin 
solution (5000 units/ml penicillin and 5000 ^g/ml streptomycin, 
a product of GIBCO) ] and cultured at 37°C for 24 hours in a 
C0 2 incubator. 

After culturing, G418 (a product of GIBCO) was added 
thereto at a concentration of 0.8 mg/ml, and culturing was 
further continued for 14 days, whereby a stable transformant 
was obtained. The transformant was subcultured in RPM1640 
containing 0.8 mg/ml G418. 

The transformed cells were subjected to indirect 
fluorescent antibody staining with ant i-sialyl-Lewis c sugar 
chain antibody (DU-PAN-2, a product of Kyowa Medex) . 

The indirect fluorescent antibody staining was 
conducted according to the method described in item (1) in 
Example 1. As a result, the cells having pAMo-3GT5 introduced 
therein showed significantly increased reactivity with the 
anti-sialyl-Lewis c sugar chain antibody (DU-PAN-2) compared 
with the cells having pAMo introduced therein(Fig. 3, A). 

Further, the control plasmid (pAMo) and human |33Gal- 
T5 expression plasmid (pAMo-3GT5 ) were introduced in the same 
manner as above into the colon cancer cell line HCT-15 
expressing no type 1 sugar chain, to give stable transformed 
cells. From the stable transformed cells, single clones 
(HCT-3GT5L and HCT-3GT5H) were obtained by the limiting 



dilution method. The amount of p3Gal-T5 transcript in 
HCT-3GT5L was lower than the amount of (33Gal-T5 transcript in 
HCT-3GT5H (see Example 4 and Table 3) . Said single clones were 
subcultured in PRMI1640 containing 0.8 mg/ml G418. 

The single clone (HCT-3GT5H) thus obtained was subjected 
to indirect fluorescent antibody staining with anti- 
sialyl-Lewis a sugar chain antibody (19-9) , anti-sialyl-Lewis 
c sugar chain antibody (DU-PAN-2, a product of Kyowa Medex) , 
anti-Lewis a sugar chain antibody (7LE) or anti-Lewis b sugar 
chain antibody (Neokokusai) . 

The indirect fluorescent antibody staining was 
conducted according to the method described in item (1) in 
Example 1. As a result, it was revealed that the cells having 
pAMo-3GT5 introduced therein showed significantly higher 
reactivity with all the 4 antibodies than that of the cells 
having pAMo introduced therein (Fig. 3, B) . 

These results indicated that |33Gal-T5 is capable of 
synthesizing type 1 sugar chains ( sialyl-Lewis a sugar chain, 
sialyl-Lewis c sugar chain, Lewis a sugar chain and Lewis b 
sugar chain) in the transformed cells. 

Further, this result means that by expressing |33Gal- 
T5 in cells, sugar chains containing type 1 sugar chains 
(sialyl-Lewis a sugar chain, sialyl-Lewis c sugar chain, Lewis 
a sugar chain and Lewis b sugar chain) as well as complex 
carbohydrates containing said sugar chains can be de novo 
synthesized. 

From the foregoing, it is evident that by secretory 
production of useful glycoproteins in host cells expressing 
(33Gal-T5, type 1 sugar chains (sialyl-Lewis a sugar chain, 
sialyl-Lewis c sugar chain, Lewis a sugar chain and Lewis b 
sugar chain, etc.) can be added to the glycoproteins to be 
produced and secreted. 

(7) Quantification of human p3Gal-T5 gene transcripts in 
various human cell lines 

Human (33Gal-T5 gene transcripts were quantified 
according to the method in item (2) in Example 1. 



Each of the single-stranded cDNAs derived from various 
cell lines, prepared in item (1) in Example 1, was used as the 
template . 

As the standard used for preparation of a calibration 
curve, linear DNA was prepared by cleaving the plasmid 
(pBS-3GT5) having human |33Gal-T5 cDNA integrated in 
pBluescript II SK(+) constructed in item (5), with suitable 
restriction enzymes to take the cDNA insert. 

As the internal control, linear DNA was prepared by 
cleaving a plasmid (pBS-3GT5d) prepared below with suitable 
restriction enzymes to take the cDNA insert. 

By deleting a 144-bp sequence between Eco 81I-XcmI in 
human (33Gal-T5 cDNA in pBS-3GT5, pBS-3GT5d was prepared. 

Quantification of the transcripts by RT-PCR was 
conducted in the same manner as in item (2) in Example 1 by 
use of p3Gal-T5 specific primers. The nucleotide sequences 
of p3Gal-T5 specific primers are shown in Table 1 and SEQ ID 
NOS: 20 and 21. 

By the p3Gal-T5 specific primers, the DNA fragment (554 
bp) shown in target in Table 1 can be amplified from p3Gal-T5 
gene transcripts and the standard. By the above primers, the 
DNA fragment (410 bp) shown in competitor in Table 1 can be 
amplified from the internal control. 

After heating at 95°C for 11 minutes, PCR was carried 
out using 42-cycle reaction, each cycle consisting of reaction 
at 95°C for 1 minute, at 65°C for 1 minute and at 72°C for 2 
minutes . 

The amounts of the (33Gal-T5 gene transcripts in various 
cell lines are shown as values relative to the amount ( = 1000) 
of the (3-actin transcript (Fig. 4, A). 

It was revealed that the ^3Gal-T5 transcript is expressed 
in the colon cancer cell lines (Colo205, Colo201, SW1116) and 
pancreatic cancer cell line (Capan-2) expressing large amounts 
of type 1 sugar chains. Further, expression of the |33Gal- 
T5 transcript was well correlated with expression of type 1 
sugar chains (see Fig. 1) . 



73 



Taking the above results in (5) and (6) into 
consideration, it is concluded that (33Gal-T5 is a novel 
(Si, 3-galactosyltransf erase involved in the synthesis of type 
1 sugar chains such as sialyl-Lewis a sugar chain and 
5 sialyl-Lewis c sugar chain in cancer cells in the digestive 
system, such as colon cancer cells or pancreatic cancer cells. 
(8) Expression of CA19-9 antigen-containing proteins in 
various human cell lines 

Expression of sialyl-Lewis a sugar chain-containing 
10 proteins in colon cancer cell lines (Colo205, Colo201, SW1116, 
LS180, HT29, WiDr, HCT-15, SW480, SW620), pancreatic cancer 
cell lines (Capan-1, Capan-2) and stomach cancer cell lines 
(KATOIII, MKN45, MKN74) was examined by Western blotting 
analysis using anti-sialyl-Lewis a antibody (19-9) . 
15 19-9 has been utilized to detect cancer-related sugar 

chains in colon cancers and pancreatic cancers, and the 
sialyl-Lewis a sugar chain antigen detected by 19-9 is called 
CA19-9 antigen. 

The respective cells (lxlO 7 cells) were suspended in a 
20 solution [20 mmol/1 HEPES (pH 7.2), 2% Triron X-100] and 
sonicated in a short time to prepare a cell lysate. 

The protein concentration in said cell lysate was 
determined using a micro-BCA protein assay reagent kit (a 
product of PIERCE) , and 10 \iq of protein was subjected to 
25 SDS-polyacrylamide gel electrophoresis (SDS-PAGE) . 

After the electrophoresis, the protein on the gel was 
transferred by Transblot SD cell (a product of Bio-Rad) onto 
Immobilon PVDF membrane (a product of Millipore) . 

The membrane was blocked by treatment overnight with a 
30 blocking solution (PBS containing 5% skim milk) at 4°C. 

After the blocking, said membrane was treated at room 
temperature for 2 hours with 10 \ig/ml anti-sialyl-Lewis a 
antibody (19-9) diluted with the blocking solution. 

After this treatment, said membrane was treated with ECL 
35 Western blotting detection reagent (a product of Amersham) , 
to detect the proteins to which 19-9 was bound. The method 
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employed was that described in instructions attached to the 
kit. 

The results are shown in Fig. 4, B. 

Expression of CAl9-9-containing glycoproteins agreed 
5 well with expression of |33Gal-T5 transcript (see Fig. 4, A) 
measured in item (7) above. On the other hand, expression of 
other GlcNAc pi, 3-galactosyltransf erase (P3Gal-Tl, p3Gal-T2, 
p3Gal-T3) transcripts was not correlated with expression of 
CAl9-9-containing glycoproteins. Further, other GlcNAc 
10 pi, 3-galactosyltransf erases (p3Gal-Tl, p3Gal-T2, p3Gal-T3) 
were not expressed in Colo205 or Colo201 highly expressing 
CAl9-9-containing glycoproteins (see Fig. 1) . 

These results indicate that p3Gal-T5 is a pi, 3- 
galactosyltransf erase involved in the synthesis of cancer- 
15 related antigen CA19-9 in colon cancers, pancreatic cancers, 
etc., and also that p3Gal-T5 can use glycoproteins as 
substrates . 

Example 3. In vitro activity of human p3Gal-T5 
20 The in vitro activity of human p3Gal-T5 encoded by human 

P3Gal-T5 cDNA obtained in Example 2 was examined in the 

following manner. 

For comparison with the activities of other known 

pi, 3-galactosyltransferases, expression plasmids (pAMo-3GTl, 
25 pAMo-3GT2, pAMo-3GT3 and pAMo-3GT4) having the cDNAs of human 

p3Gal-Tl, human p3Gal-T2, human p3Gal-T3 and human p3Gal-T4 

integrated therein, respectively, were constructed. 

pAMo-3GTl is identical with plasmid pAMoPRWMl used for 

constructing plasmid pUC119-WMl (FERM BP-4011) described in 
30 Japanese Published Unexamined Patent Application No. 

181759/94 . 

The control plasmid (pAMo) or each of the five pi, 3- 
galactosyltransf erase expression plasmids (pAMo-3GTl, 
pAMo-3GT2, pAMo-3GT3, pAMo-3GT4 and pAMo-3GT5 ) was introduced 
35 in the same manner as in Example 3 into Namalwa cells to give 
the respective transformed cells. 
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In the same manner as in (2) in Example 1, total RNA was 
extracted from the transformed cells, and the amounts of 
transcripts by the five pi, 3-galactosyltransf erase genes were 
measured by quantitative RT-PCR. 

The results are shown in Table 3. 
Table 3 . pi, 3-galactosyltransf erase activity toward agalacto 
LNnT as a substrate and expression of P3Gal-T transcripts in 
each kind of cells 



X Expression ofj83Gal-T transcripts 

activity ( j8 3Ga I-T//3 -act i n xtO 3 ) 







j8 3Gal- 


j8 3Gal- 


j8 3Gai- 


£3Gal- 


j8 3Gal- 






T5 


T1 


T2 


T3 


T4 


Namalwa-mock 


<1 


<0.Q1 


<0.01 


2. 1 


<0. 01 


1.4 


Namalwa-3GT1 


O 


<0.01 


56 


2. 1 


<0.01 


1.4 


Namalwa-3GT2 


<1 


<0. 01 


<0. 01 


38 


<0. 01 


1.4 


Namalwa-3GT3 


<1 


<0. 01 


<0. 01 


2. 1 


42 


1.4 


Namalwa-3GT4 


<1 


<0. 01 


<0.01 


2. 1 


<0.01 


43 


Namalwa-3GT5 


100 


35 


<0.01 


2. 1 


<0.01 


1.4 


HCT-3GT5L 


18 


5 


<0. 01 


<0. 01 


2. 2 


0. 1 


HCT-3GT5H 


48 


16 


<0. 01 


<0. 01 


2. 2 


0. 1 


Colo205 


40 


3.5 


<0.01 


<0. 01 


0.05 


4.6 


SWT 116 


22 


1.4 


<0.01 


0.3 


0.6 


3.2 


HCT-15 


<1 


<0. 01 


<0. 01 


<0. 01 


2.2 


0. 1 


Capan-2 


23 


1.2 


<0. 01 


<0. 01 


1.9 


2.3 


MKN45 


<1 


<0. 01 


0.04 


0. 5 


66 


<0.01 


PC-1 


<1 


<0. 01 


27 


0.3 


<0. 01 


0.2 



It was confirmed that the amounts of transcripts of 
corresponding pi, 3-galactosyltransf erase genes are higher in 
the cells having the respective expression plasmids introduced 
therein than in the cells having only the vector introduced 
therein „ 
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Said transformed cells were suspended in a solution [20 
mmol/1 HEPES (pH7.2), 2% Triron X-100 ] and sonicated in a short 
time to prepare a cell lysate solution. 

The protein concentration of said cell lysate solution 
5 was determined using a micro-BCA protein assay reagent kit (a 
product of PIERCE) . 

Said cell lysate solution was used to measure f51,3- 
galactosyltransf erase activity. 

Preparation of ' the pyridylaminated sugar chain 
10 substrate and measurement of the activity were conducted 
according to known methods [Japanese Published Unexamined 
Patent Application Nos. 181759/94 and 823021/94, J. Biol. Chem. , 
269 r 14730-14737 (1994)]. 

Specifically, the activity was measured by identifying 
15 a product by high performance liquid chromatography (HPLC) 
after reaction at 37°C for 2 hours in 10 \il assay solution [14 
mmol/1 HEPES (pH 7 . 4) , 75 yitiol/1 UDP-Gal (a product of SIGMA) , 
11 nmol/1 MnCl 2 , 0.01% Triron X-100, 25 jimol/1 pyridylaminated 
sugar chain substrate, and the above cell lysate solution] . 
20 The substrate (agalacotoLNnT, GlcNAcpi-3Galpl-4Glc) 

was obtained by treatment, with f$-galactosidase, of lacto- 
N-neotetraose (Gal|31-4GlcNAcpi-3Gal(31-4Glc, hereinafter 
abbreviated to LNnT) fluorescence-labeled with aminopyridine, 
to remove the terminal galactose residue. 
25 AgalactoLNnT was prepared by adding 100 mU of (3- 

galactosidase (a product of Seikagaku Kogyo Co. , Ltd. ) to about 
60 nmol LNnT fluorescence-labeled with aminopyridine, 
reacting them at 37°C for 16 hours and heat treatment thereof 
at 100°C for 5 minutes to inactivate the p-galact osidase . 
30 As the standard, LNnT fluorescence-labeled with 

aminopyridine or Lacto-N-tetraose (Gal|31-3GlcNAc|31-3Gal|3l- 
4Glc, hereinafter abbreviated to LNT) fluorescence-labeled 
with aminopyridine was used. LNnT and LNT were purchased from 
Oxford Glico Systems Ltd. Fluorescence-labeling of the 
35 oligosaccharides was carried out in a usual manner [Agric . Biol . 
Chem. , 54, 2169 (1990) ] . 
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After the reaction with an assay solution containing or 
not containing UDP-Gal (sugar donor) , the reaction solution 
was analyzed by HPLC, and a peak appearing in only the assay 
solution containing UDP-Gal was assumed to be a product. 

After the reaction, the assay solution was treated at 
100°C for 3 minutes and centrifuged at 10, 000xg for 5 minutes 
to give a supernatant, and its aliquot was subjected to HPLC. 
HPLC was conducted at an elution temperature of 25°C at a flow 
rate of 1 ml/min using a TSK-gel ODS-80Ts column (4.6x300 mm, 
a product of Tosoh Corporation) and 0 . 02 mol/1 ammonium acetate 
buffer (pH 4.0) as an eluent. 

Detection of the product was conducted using a 
fluorescence spectrophotometer FP-920 (a product of Nippon 
Bunko Co., Ltd.) (excitation wavelength, 320 nm; emission 
wavelength, 400 nm) . 

The product was identified by comparing its retention 
time with that of standard sugar chain. 

Quantification of the product was conducted by comparing 
its fluorescence density with that of aminopyridylated lactose 
as the standard. 

The activities of the respective |3l,3- 
galactosyltransferases, relative to the activity (= 100) of 
human |33Gal-T5 are shown in Table 3. 

Apparent |3l , 3-galactosyltransf erase activity (LNT 
synthesis activity) was detected in the cells expressing human 
(33Gal-T5, but no activity was detected in the cells expressing 
other (31, 3-galactosyltransf erases . No activity was detected 
either in the cells having the control plasmid (pAMo) 
introduced therein. 

In the cells having the respective |3l,3- 
galactosyltransf erase expression plasmids introduced therein, 
the respective (31 , 3-galactosyltransf erase transcripts were 
expressed to the same degree (Table 3) , thus revealing that 
the GlcNAc (31 , 3-galactosyltransf erase activity (LNT synthesis 
activity) of human (33Gal-T5 is stronger than those of other 
GlcNAc (31, 3-galactosyltransf erases (human |33Gal-Tl, human 
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p3Gal-T2, human p3Gal-T3) . On the other hand, human (33Gal-T4 
is known to have GalNAc pi , 3-galactosyltransf erase activity, 
but not GlcNAc |3l , 3-galactosyltransf erase activity. 

From the results described above, it was proven that 
human (33Gal-T5 is a GlcNAc pi, 3-galactosyltransf erase . 
Further, the activity of human p3Gal-T5 is stronger than the 
other GlcNAc pi , 3-galactosyltransf erases (human p3Gal-Tl, 
human p3Gal-T2, human p3Gal-T3) , thus indicating that human 
p3Gal-T5 is useful for synthesis of type 1 sugar chains such 
as LNT. 

To examine whether the amount of p3Gal-T5 transcript 
expressed is correlated with GlcNAc pi,3- 

galactosyltransferase activity (LNT synthesis activity) , 
HCT-15 cells (HCT-3GT5L and HCT-3GT5H) to which the p3Gal- 
T5 expression plasmid obtained in Example 2 had been introduced, 
and the colon cancer cell lines (Colo205, SW1116, HCT-15) , the 
pancreatic cancer cell line (Capan-2) , the stomach cancer cell 
line (MKN45) and the lung cancer cell line (PC-1) used in 
Example 1 were examined for the amount of p3Gal-T5 transcript 
expressed therein and for activity of GlcNAc pi,3- 
galactosyltransferase activity (LNT synthesis activity) . 

The amount of p3Gal-T5 transcript expressed in each kind 
of cells was measured using the method described in Example 
1 or in item (7) in Example 2. GlcNAc pi, 3- 
galactosyltransferase activity (LNT synthesis activity) was 
measured according to the method described above. 

The results are shown in Table 3. 

As a result, it was revealed that the amount of p3Gal-T5 
transcript expressed is correlated with GlcNAc pi,3- 
galactosyltransf erase activity (LNT synthesis activity) . For 
example, the amount of p3Gal-T5 transcript expressed in 
HCT-3GT5H was about 3 times as high as that in HCT-3GT5L, and 
the GlcNAc pi , 3-galactosyltransf erase activity (LNT synthesis 
activity) in HCT-3GT5H was also about 3 times as high as that 
in HCT-3GT5L . 

On the other hand, PC-1 expressed a large amount of 
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p3Gal-Tl, but GlcNAc pi, 3-galactosyltransf erase activity (LNT 
synthesis activity) was not detected. Further, MKN45 
expressed a large amount of p3Gal-T3, but GlcNAc (31,3- 
galactosyltransferase activity ( LNT synthesis activity) was 
not detected. 

This result indicates that the GlcNAc pi, 3- 
galactosyktransferase activity (LNT synthesis activity) of 
p3Gal-Tl and p3Gal-T3 is lower than that of p3Gal-T5, and this 
agrees with the result of analysis using the above Namalwa 
cells . 

Example 4. Expression of the p3Gal-T5 gene in various organs 
The p3Gal-T5 transcripts in various human tissues (brain, 
lung, esophagus, stomach (body), stomach (antrum), jejunum, 
colon, liver, pancreas, spleen, kidney, adrenal, uterus, 
peripheral lymphocytes) were quantified by RT-PCR in the same 
manner as in item (7) in Example 2. The amount of the p3Gal-T5 
gene transcript in each kind of organs is shown as a value 
relative to the amount < = 1000) of the p-actin transcript (Fig. 
5) . 

It was revealed that the p3Gal-T5 transcripts are 
significantly expressed in the stomach (body) , stomach 
(antrum), jejunum, colon and pancreas . Further, the P3Gal-T5 
transcripts were slightly expressed in the brain, esophagus, 
kidney and uterus . On the other hand, the p3Gal-T5 transcripts 
were not expressed in the lung, liver, spleen, adrenal and 
peripheral lymphocytes. 

Example 5. Structural analysis of the pGal-T5 chromosomal 
gene 

At present, a large number of human chromosomal genes 
whose functions are not known are registered in a database. 
Accordingly, by comparing the sequence of the human cDNA coding 
for the polypeptide of the present invention with the sequences 
of human chromosomal genes registered in a database, the human 
chromosomal gene coding for the polypeptide of the present 
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invention would be identified and its structure would be 
revealed. If the sequence of a chromosomal gene consistent 
with the sequence of said cDNA has been registered, the promoter 
region and exon and intron structures in the chromosomal gene 
coding for the polypeptide of the present invention can be 
determined by comparing the sequence of said cDNA with the 
sequence of said chromosomal gene. 

Comparison between the nucleotide sequence of (33Gal- 
T5 cDNA (SEQ ID NO: 2) and DNA sequences registered in Genome 
Project Database revealed that since a part of the sequence 
of Registration No. AF064860 contains the sequence of the cDNA 
of (33Gal-T5, the |3Gal-T5 chromosomal gene is located in human 
chromosome 21q22.3. 

For the purpose of revealing the promoter region in the 
(33Gal-T5 chromosomal gene, the 5' -terminal region of p3Gal-T5 
cDNA was obtained from Colo205 cells by the 5' RACE method. 
The 5' RACE method employed was that described in instructions 
attached to the kit (5' RACE system, Version 2.0, a product 
of GIBCO) . 

First, single-stranded cDNA was synthesized using 
Colo205-derived mRNA (1 \ig) as a template and two synthetic 
DNAs having the sequences represented by SEQ ID NOS : 22 and 
23 as primers. After the synthesis, terminal deoxynucleotidyl 
transferase was used to add oligo-dC to the 3' -terminus of said 
cDNA, and then PCR was conducted using synthetic DNA having 
a dG tail attached to the kit as the forward primer and synthetic 
DNA having the sequence represented by SEQ ID NO: 24 as the 
reverse primer. 

After heating at 97°C for 11 minutes, PCR was carried 
out using 42-cycle reaction, each cycle consisting of reaction 
at 94°C for 1 minute, at 55°C for 1 minute and at 72°C for 2 
minutes . 

The amplified fragment was digested with Hindi 1 1 and Srie.1 
and then subcloned between HindHI-S^el sites in pBluescript 
SK(-). The five plasmids thus obtained were sequenced, and 
as a result, it was revealed that the origin of transcription 
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of the p3Gal-T5 chromosomal gene in Colo205 cells is a base 
at the 85153-position in Registration No. AF064860 mentioned 
above . 

Accordingly, it was revealed that the region upstream 
from this origin of transcription is a promoter region 
functioning at least in Colo205 cells. 

The promoter region (including the transcription- 
regulating region) on the |33Gal-T5 chromosomal gene is 
estimated to be a 5-kb region (sequence of 1 to 5000 in the 
nucleotide sequence represented by SEQ ID NO: 3) upstream from 
the origin of transcription. 

A 1-kb region (sequence of 4001 to 5000 in the nucleotide 
sequence represented by SEQ ID NO: 3) upstream from the origin 
of transcription was analyzed for the presence of a consensus 
sequence for a binding sequence of a transcription factor by 
using a TFSEARCH (transcription factor search) program 
(Akiyama, Y., http://www.rwcp.or.jp/lab/pdappl/papia.html). 

No TATA box was present upstream of the origin of 
transcription, but it is estimated that two CdxA sites, one 
AP-1 site and one MZF-1 (myeloid zinc finger 1 protein) site 
are present in a 150-bp region upstream of the origin of 
transcription . 

The promoter region can also be experimentally specified 
by introducing a plasmid having a reporter gene ligated 
downstream from said region into cells expressing |33Gal-T5 and 
examining whether the reporter gene is expressed or not. 

For further analyzing exon and intron regions in the 
|33Gal-T5 chromosomal gene, whether isoforms of p3Gal-T5 cDNA 
are present or not was examined by the PCR method. 

PCR was carried out using the Colo205 cell-derived 
single-stranded cDNA prepared in Example 1 as a template and 
synthetic DNAs having the sequences represented by SEQ ID NOS : 
22 and 25 as primers. 

After heating at 97°C for 11 minutes, PCR was carried 
out using 42-cycle reaction, each cycle consisting of reaction 
at 94°C for 1 minute, at 55°C for 1 minute and at 72°C for 2 
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minutes . 

The amplified fragment was digested with Hindi 1 1 and then 
subcloned into Hindi I I site in pBluescript SK(-). The 
plasmids thus obtained were determined for their nucleotide 
sequences, and as a result, it was revealed that at least five 
p3Gal-T5 cDNA isoforms are present in Colo205 cells (Fig. 6) . 

The proportion of the expression levels of the respective 
isoforms, as determined by comparing the amounts of the 
PCR-amplif ied fragments corresponding to the respective 
isoforms, was that isoform 1 is 50%, isoform 2 is 50%, and 
isoforms 3, 4 and 5 are 1% or less, respectively. 

The PCR-amplif ied fragment corresponding to each 
isoform was specified by the size of the amplified fragment 
and by treatment with restriction enzyme (Xbal or BsmI) . 

The above results revealed that the p3Gal-T5 chromosomal 
gene comprises 4 exons and 3 introns. The sequence of the 
promoter region (including transcription-regulating region) 
of the (33Gal-T5 chromosomal gene and the sequence of the 
p3Gal-T5 chromosomal gene are represented by SEQ ID NO: 3 . The 
sequence of the promoter region (including transcription- 
regulating region) of the p3Gal-T5 chromosomal gene is a 1- 
to 5000-bp in the nucleotide sequence represented by SEQ ID 
NO: 3. The sequence of the p3Gal-T5 chromosomal gene is a 5001- 
to 10562-bp in the nucleotide sequence represented by SEQ ID 
NO: 3. The positions of the exons and introns in the p3Gal-T5 
chromosomal gene are shown in Table 4 by using the nucleotide 
numbers in SEQ ID NO: 3. In Table 4, the nucleotide sequence 
indicated by capital letters is an exon region, the nucleotide 
sequence indicated by small letters is an intron region. 

The structure of the p3Gal-T5 chromosomal gene 
(positions and sequences of the exon and intron regions) , its 
position on the chromosome, and the position and sequence of 
the promoter region of the p3Gal-T5 chromosomal gene could be 
specified for the first time by revealing the structure and 
functions of the p3Gal-T5 cDNA according to the present 
invention . 
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Table 4. Exon/intron junctions of the jS 3Ga 1 -T5 chromosomal gene 



Exon 
No. 


Nucleotide 
sequence No. in 
SEQ ID N0:3 


Length 
(bp) 


Sequence of spl ice 
acceptor site 


Sequence of spl ice 
donor site 


exonl 


5001-5273 


273 




CTGTCACGgtatttcc 


exonl' 


5001-5140 


140 




CCAAGCAGgtttctgg 


exon2 


5459-5567 


109 


c t ct c t agAG AACC C T 


GTTTGGAGgtagggct 


exon3 


7427-7586 


160 


tttcctagTGATTCCT 


AGCAAAAAgtgagtta 


exon4 


8234-10562 


2329 


cctttcagATGGCTTT 





Industrial Applicability 

According to the present invention, there can be provided 
5 a novel polypeptide having pi, 3-galactosyltransf erase 
activity involved in the synthesis of type 1 sugar chains in 
cancer cells in the digestive system, such as colon cancer cells, 
pancreatic cancer cells, etc., a process for producing said 
polypeptide, a DNA coding for said polypeptide, a recombinant 

10 vector comprising said DNA integrated therein, a transformant 
carrying said recombinant vector, an antibody recognizing said 
polypeptide, a quantification method and an immunostaining 
method for the polypeptide of the present invention by use of 
said antibody, a process for producing type 1 sugar chain- 

15 containing sugar chains and complex carbohydrates containing 
said sugar chains by use of said polypeptide, a process for 
producing type 1 sugar chain-containing sugar chains and 
complex carbohydrates containing said sugar chains by use of 
a transformant carrying said recombinant vector, a method for 

20 screening a substance altering expression of a gene coding for 
said polypeptide, a method for screening a substance altering 
activity of said polypeptide, a method for diagnosis of 
diseases such as colon cancers, pancreatic cancers and stomach 
cancers by use of said DNA or said antibody, and a method for 

25 treating diseases such as colon cancers, pancreatic cancers 
and stomach cancers by use of said DNA, said substance altering 
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expression of a gene coding for the polypeptide, or said 
substance altering the activity of the polypeptide. 
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SEQUENCE LISTING FREE TEXT 

SEQ ID NO: 4 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 5 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 6 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 7 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 8 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 9 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 10 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 11 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 12 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 13 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 14 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 15 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 16 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 17 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 18 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 19 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 20 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 21 - Description of Artificial Sequence: Synthetic 
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DNA 

SEQ ID NO: 22 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 23 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 24 - Description of Artificial Sequence: Synthetic 
DNA 

SEQ ID NO: 25 - Description of Artificial Sequence: Synthetic 
DNA 



CLAIMS 

1. A polypeptide having pi , 3-galactosyltransf erase 
activity involved in the synthesis of sialyl-Lewis a sugar 
chain, present in colon cancer cells expressing sialyl-Lewis 
a sugar chain. 

2. A polypeptide selected from the group consisting of: 

(a) a polypeptide consisting of the amino acid sequence 
represented by SEQ ID NO: 1, 

(b) a polypeptide containing the amino acid sequence of 
31 to 310 in the amino acid sequence represented by SEQ ID NO: 
1, and 

(c) a polypeptide consisting of an amino acid sequence 
where in the amino acid sequence of the polypeptide (a) or (b) , 
one or more amino acids have been deleted, replaced or added 
and having pi , 3-galactosyltransf erase activity capable of 
synthesizing Galpl-3GlcNAc structure. 

3. A polypeptide according to claim 1 or 2 wherein the 
pi, 3-galactosyltransferase activity is the activity of 
transferring galactose via pl,3-linkage to N- 
acetylglucosamine residue present at the non-reducing 
terminus of a sugar chain. 

4. A polypeptide according to claim 1 or 2 wherein the 
pi, 3-galactosyltransferase activity is the activity of 
transferring galactose via pl,3-linkage to N- 
acetylglucosamine residue present at the non-reducing 
terminus of GlcNAcpi-3Galpl-4Glc or to N-acetylglucosamine 
monosaccharide . 

5. A DNA selected from the group consisting of: 

(a) DNA coding for the polypeptide described in any one 
of claims 1 to 4, 

(b) DNA having the nucleotide sequence of 402 to 1331 
in the nucleotide sequence represented by SEQ ID NO: 2, 

(c) DNA having the nucleotide sequence of 4 92 to 1331 
in the nucleotide sequence represented by SEQ ID NO: 2, and 

(d) DNA hybridizing under stringent conditions with the 
DNA described in any of (a) to (c) and coding for a polypeptide 



having (31 , 3-galactosyltransf erase activity capable of 
synthesizing Gal|31-3GlcNAc structure. 

6. A recombinant DNA prepared by integrating the DNA 
described in claim 5 into a vector. 

7 . A recombinant DNA according to claim 6 which is plasmid 
pAMo-3GT5 or plasmid pBS-3GT5 (FERM BP-6645) . 

8. A transformant harboring the DNA described in claim 
5, the recombinant DNA in claim 6 or the recombinant DNA in 
claim 7 . 

9. A transformant according to claim 8 which is a member 
selected from the group consisting of a microorganism, an 
animal cell, a plant cell, an insect cell, a non-human 
transgenic animal and a transgenic plant. 

10. A transformant according to claim 9 wherein the 
microorganism is a microorganism belonging to the genus 
Escherichia . 

11. A transformant according to claim 9 wherein the 
animal cell is a member selected from the group consisting of 
a mouse myeloma cell, a rat myeloma cell, a mouse hybridoma 
cell, a CHO cell, a BHK cell, an African green monkey kidney 
cell, a Namalwa cell, a Namalwa KJM-1 cell, a human embryonic 
kidney cell and a human leukemia cell. 

12. A transformant according to claim 9 wherein the 
insect cell is a member selected from the group consisting of 
a Spodoptera frugiperda ovarian cell, a Trichoplusia ni ovarian 
cell and a silkworm ovarian cell. 

13. A process for producing the polypeptide described 
in any one of claims 1 to 4, which comprises culturing a 
transformant harboring a recombinant DNA prepared by 
integrating DNA coding for the polypeptide of any one of claims 
1 to 4 into a vector in a medium to thereby form and accumulate 
said polypeptide in the culture, and collecting said 
polypeptide from said culture. 

14. A process for producing the polypeptide described 
in any one of claims 1 to 4, which comprises breeding a non-human 
transgenic animal harboring a recombinant DNA prepared by 
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integrating DNA coding for the polypeptide of any one of claims 
1 to 4 into a vector to thereby form and accumulate said 
polypeptide in said animal, and collecting said polypeptide 
from said animal. 

15. A process according to claim 14 wherein formation 
and accumulation occur in animal milk. 

16. A process for producing the polypeptide described 
in any one of claims 1 to 4, which comprises culturing a 
transgenic plant harboring a recombinant DNA prepared by 
integrating DNA coding for the polypeptide of any one of claims 
1 to 4 into a vector to thereby form and accumulate said 
polypeptide in said plant, and collecting said polypeptide from 
said plant. 

17. A process for producing the polypeptide described 
in any one of claims 1 to 4, which comprises synthesizing the 
polypeptide of any one of claims 1 to 4 in an in vitro 
transcription-translation system using DNA coding for said 
polypeptide . 

18. A process for producing a reaction product having 
galactose, which comprises using the polypeptide of any one 
of claims 1 to 4 as an enzyme source, and allowing 

(a) said enzyme source, 

(b) an acceptor substrate selected from the group 
consisting of: 

i) N-acetylglucosamine (GlcNAc) , 

ii) an oligosaccharide having N-acetylglucosamine 
residue at the non-reducing terminus thereof, and 

iii) a complex carbohydrate having N-acetylglucosamine 
residue at the non-reducing terminus thereof, and 

(c) uridine-5' -diphosphate galactose to be present in 
an aqueous medium to thereby form and accumulate said reaction 
product in the aqueous medium, and collecting said reaction 
product from said aqueous medium, wherein the galactose is 
transferred via (31 , 3-linkage to N-acetylglucosamine or N- 
acetylglucosamine residue of said acceptor substrate. 

19. A process for producing a reaction product having 
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galactose, which comprises using the polypeptide of any of 
claims 1 to 4 as an enzyme source, and allowing 

(a) said enzyme source, 

(b) an acceptor substrate selected from the group 
5 consisting of: 

i) glucose, 

ii) an oligosaccharide having glucose residue at the 
non-reducing terminus thereof, and 

iii) a complex carbohydrate having glucose residue at 
10 the non-reducing terminus thereof, and 

(c) uridine-5' -diphosphate galactose to be present in 
an aqueous medium to thereby form and accumulate said reaction 
product in the aqueous medium, and collecting said reaction 
product from said aqueous medium, wherein the galactose is 

15 transferred via |31 , 3-linkage to glucose or glucose residue of 
said acceptor substrate. 

20. A process for producing a sugar chain or a complex 
carbohydrate, which comprises culturing the transformant 
selected from the group consisting of transf ormants of claim 

20 9 derived from a microorganism, an animal cell, a plant cell 
and an insect cell in a medium to thereby form and accumulate 
a sugar chain having galactose transferred via pi , 3-linkage 
to N-acetylglucosamine, N-acetylglucosamine residue, glucose 
or glucose residue thereof or a complex carbohydrate containing 

25 said sugar chain in the culture, and collecting said sugar chain 
or said complex carbohydrate from said culture. 

21. A process for producing a sugar chain or a complex 
carbohydrate, which comprises breeding the non-human 
transgenic animal of claim 9 to thereby form and accumulate 

30 in said animal a sugar chain having galactose transferred via 
(31, 3-linkage to N-acetylglucosamine, N-acetylglucosamine 
residue, glucose or glucose residue thereof or a complex 
carbohydrate containing said sugar chain, and collecting said 
sugar chain or said complex carbohydrate from said animal. 

35 22. A process for producing a sugar chain or a complex 

carbohydrate, which comprises culturing the transgenic plant 
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of claim 9 to thereby form and accumulate in said plant a sugar 
chain having galactose transferred via (Jl, 3-linkage to N- 
acetylglucosamine, N-acetylglucosamine residue, glucose or 
glucose residue thereof or a complex carbohydrate containing 
5 said sugar chain, and collecting said sugar chain or said 
complex carbohydrate from said plant. 

23. A process according to any one of claims 18 to 22 
wherein the complex carbohydrate is a complex carbohydrate 
selected from the group consisting of a glycoprotein, a 

10 glycolipid, a proteoglycan, a glycopeptide , a 

lipopolysaccharide, a peptidoglycan and a glycoside which is 
a steroid compound with a sugar chain. 

24. A process according to claim 21 wherein formation 
and accumulation occur in animal milk. 

15 25. A method for determining the expression level of a 

gene encoding the polypeptide of any one of claims 1 to 4, which 
comprises hybridization using DNA coding for said polypeptide 
or a fragment of said DNA. 

26. A DNA selected from the group consisting of an 
20 oligonucleotide having the same nucleotide seguence as a 

consecutive 5- to 60-nucleotide sequence in the nucleotide 
sequence of the DNA of claim 5 or of a DNA having the nucleotide 
sequence represented by SEQ ID NO: 2 or 3, an oligonucleotide 
having a sequence complementary to said oligonucleotide, and 
25 an oligonucleotide derivative of any of said oligonucleotides . 

27. A DNA according to claim 26 wherein the 
oligonucleotide derivative is selected from the group 
consisting of an oligonucleotide derivative in which the 
phosphodiester bond is converted into a phosphorothioate bond, 

30 an oligonucleotide derivative in which the phosphodiester bond 
is converted into an N3 ' -P5 ' -phosphoamidate bond, an 
oligonucleotide derivative in which the ribose and the 
phosphodiester bond are converted into a peptide-nucleic acid 
bond, an oligonucleotide derivative in which the uracil is 

35 replaced by a C-5 propynyluracil , an oligonucleotide 
derivative in which the uracil is replaced by a C-5 
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thiazolyluracil, an oligonucleotide derivative in which the 
cytosine is replaced by a C-5 propynylcytosine, an 
oligonucleotide derivative in which the cytosine is replaced 
by a phenoxazine-modif ied cytosine, an oligonucleotide 
5 derivative in which the ribose is replaced by a 2'-0- 

propylribose, and an oligonucleotide derivative in which the 
ribose is replaced by a 2 ' -methoxyethoxyribose . 

28. A DNA that has a nucleotide sequence represented by 
SEQ ID NO: 20 or 21. 

10 29. A method for determining the expression level of a 

gene encoding the polypeptide of any one of claims 1 to 4, which 
comprises polymerase chain reaction using the oligonucleotide 
of any one of claims 26 to 28. 

30 . Amethod for detecting cancers and cancer metastasis, 

15 which comprises using the method of claim 25 or 29. 

31. A method for inhibiting transcription of DNA coding 
for the polypeptide of any one of claims 1 to 4 or translation 
of its corresponding mRNA, which comprises using a DNA selected 
from DNAs of claims 5 and 26 to 28 and DNAs having a nucleotide 

20 sequence represented by SEQ ID NO: 2 or 3 . 

32. An antibody recognizing the polypeptide of any one 
of claims 1 to 4 . 

33. A method for immunological detection of the 
polypeptide of any one of claims 1 to 4, which comprises using 

25 the antibody of claim 32. 

34. An immunohistostaining method, which comprises 
detecting the polypeptide of any one of claims 1 to 4 by using 
the antibody of claim 32. 

35. An immunohistostaining agent comprising the 
30 antibody of claim 32. 

36. A diagnostic reagent for cancers or cancer metastasis , 
which comprises the antibody of claim 32. 

37. A method for screening a compound varying the 
activity of the polypeptide of any one of claims 1 to 4, which 

35 comprises contacting said polypeptide with a test sample. 

38. A method for screening a compound varying the 
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expression of a gene coding for the polypeptide of any one of 
claims 1 to 4 , which comprises contacting cells expressing said 
polypeptide with a test sample and determining the content of 
sialyl-Lewis a sugar chain, Lewis a sugar chain, Lewis b sugar 
5 chain or sialyl-Lewis c sugar chain by use of anti-sialyl- 
Lewis a antibody, anti-Lewis a antibody, anti-Lewis b antibody 
or anti-sialyl-Lewis c antibody. 

39. A method for screening a compound varying the 
expression of a gene coding for the polypeptide of any one of 

10 claims 1 to 4, which comprises contacting cells expressing said 
polypeptide with a test sample and determining the content of 
said polypeptide by use of the antibody of claim 32. 

40. A promoter DNA governing transcription of a gene 
coding for the polypeptide described in any one of claims 1 

15 to 4. 

41. A promoter DNA according to claim 40, which works 
in cells selected from the group consisting of small intestine 
cells, large intestine cells, pancreas cells, stomach cells, 
colon cancer cells, pancreatic cancer cells and stomach cancer 

20 cells. 

42. A promoter DNA according to claim 40 or 41, which 
is a human- or mouse-derived promoter DNA. 

43. A promoter DNA according to any one of claims 40 to 
42, which comprises a 50- to 5000-bp consecutive nucleotide 

25 DNA seguence in the nucleotide sequence of 1 to 5000 in the 
nucleotide sequence represented by SEQ ID NO: 3. 

44. A method for screening a compound varying the 
efficiency of transcription by the promoter DNA of any one of 
claims 40 to 43, which comprises transforming animal cells with 

30 a plasmid containing said promoter DNA and a reporter gene 
ligated downstream of said promoter DNA, then contacting the 
transformant with a test sample, and determining the content 
of a translation product of said reporter gene. 

45. A screening method according to claim 44 wherein the 
35 reporter gene is a gene selected f rom /the group consisting of 

a chloramphenicol acetyltransf erase gene, a ^-glucuronidase 



gene, a p-galactosidase gene, a (3-lactamase gene, a luciferase 
gene, an aequorin gene and a green fluorescent protein gene. 

4 6. A knockout non-human animal wherein a DNA coding for 
the polypeptide of any one of claims 1 to 4 is rendered defective 
or mutated. 

47. A knockout non-human animal according to claim 46 
wherein the knockout non-human animal is a mouse. 
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ABSTRACT 

According to the present invention, there can be provided 
a novel polypeptide having pi, 3-galactosyltransf erase 
activity involved in the synthesis of type 1 sugar chains, a 
5 DNA coding for said polypeptide, a recombinant vector 

comprising said DNA, a transformant carrying said recombinant 
vector, a process for producing type 1 sugar chain-containing 
sugar chains and complex carbohydrates by use of said 
polypeptide or said transformant, an antibody recognizing said 

10 polypeptide, a method for detecting or quantifying said 

polypeptide by use of said antibody, a method for screening 
a substance correlated with said polypeptide, a method for 
diagnosis or treatment of cancers in the digestive system by 
use of said DNA or said antibody, and a method for treatment 

15 of cancers in the digestive system by use of a substance 
obtained by said screening method. 



09/914152 




0^/ S 1415^ 



Fig. 2 




2/6 



09/914152 



Fig. 3 



Namalwa-mock Namalwa-3GT5 




log FITC log FITC 

HCT-mock HCT-3GT5H 



(sLe a ) 



(sLe c ) 



(Le a ) 



(Le b ) 



log FITC log FITC 



3/6 



09/914152 



Fig. 4 



IllUjL 



r-i «j v q 

s 3 a i 



U H 00 N 



9 2 




4/6 



9/Q 




ssq-AocqduiAx 

I9AT1 

uoioo 

umunCsp 
(uinj:q.up ) 

(Apoq) 
fiurvi 



CO- jo junouns 9AT^SX 9 H 



251*16/60 




6/6 



COMBINED DECLARATION AND POWER OF ATTORNEY 
FOR PATENT COOPERATION TREATY APPLICATION 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, 
first and joint inventor (if plural names are listed below) of the subject matter which is claimed and for 
which a patent is sought on the invention entitled NOVEL POLYPEPTIDES 

the specification of which was filed as PCT International Application No. PCT/JPOQ/01070 on 
24/02/00 and was amended under PCT Article 19 on (if applicable). 

I hereby state that I have reviewed and understand the contents of the above identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to patentability as defined in 
37 CFR§1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, § 1 19 of any foreign 
application(s) for patent or inventor's certificate or of any PCT international application(s) designating at 
least one country other than the United States of America listed below and have also identified below any 
foreign application(s) for patent or inventor's certificate or any PCT international application(s) designating 
at least one country other than the United States of America filed by me on the same subject matter having 
a filing date before that of the application(s) on which priority is claimed: 

Country Application No. Filed (Day/Mo./Yr.) Priority Claimed (Yes/No) 

JP 47571/99 25/02/99 YES 



I hereby appoint the practitioners associated with the firm and Customer Number provided below 
to prosecute this application and to transact all business in the Patent and Trademark Office connected 
therewith, and direct that all correspondence be addressed to the address associated with that Customer 
Number: 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements were 
made with the knowledge that willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 ofTitle 18 of the United States Code and that such willful false 
statements may jeopardize the validity of the application or any patent issued thereon. 



FITZPATRICK, CELLA, HA] 
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FOR PATENT COOPERATION TREATY APPLICATION 
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Full Name of Sole or First Inventor 



Inventor's signature . 



Jrll ashi WarimataiL. 



D ate August 6, 20 01 Citizen/Subject of _ 

Residence Same as Post Office Address 



Post Office Address 5-1 082-1, Mukohar a ■ Hiaashivamato-shi r jrokyo ?.07-0013 Japan 

~ 

Full Name of Second Joint Inventor, if any ^Soichiro I SShj^kj^ _ — — 

Second Inventor's signature h ^dLU* Q <^uJU 

Dale August 10, 2001 Citizen/Subjecfof Japan _ 

^.w. s^p as Post Office Address 



Address 1-3Q-2-101, Chuo, Nakano-ku, TokyoVl^ -OOll Japan 



rhWfloint Inventor, if an y AkXC 



Full Name of ThWfloint Inventor, if any Akira Togavachi 

Third Inventor's s.gnature Ol^l ^f^<LcM 
Date August 6, 2001 Citizen/Subject of Japan 

R^nce Same as Post Office Address 



Post Office Address 1-11-42, Shimosato, H i q as hikurnm e-shi, Tokyo 203-0<)43 Japan 



H-QJ 

Full Name of Fourth Joint Inventor, if any , Ka t S U t O S hi S a S a kx 



Fourth Inventor's signature 

Date August 3, 2 001 citizen/Subject of Japan 

Residence Saaamiha ra-shi . K anaaawa, JaparA 



Post Office Address r./n Tokyo Research La borator ies, Kyowa Hakko Kogyo, Co., Ltd. 
6-6, Asahi-machi 3-chome. Machida-shi, T okyo 194-8533 Japan . — 



Full Name of Fifth Joint Inventor, if any . 
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SEQUENCE LISTING 

<110> Hisashi NARIMATSU et al. : 

<120> NOVEL POLYPEPTIDES 

<130> 11192US1 

<140> 
<141> 

<150> JP 99/47571 
<151> 2000-02-25 

<160> 25 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 310 
<212> PRT 

<213> Homo sapiens 
<400> 1 

Met Ala Phe Pro Lys Met Arg Leu Met Tyr He Cys Leu Leu Val Leu 
15 10 15 

Gly Ala Leu Cys Leu Tyr Phe Ser Met Tyr Ser Leu Asn Pro Phe Lys 
20 25 30 

Glu Gin Ser Phe Val Tyr Lys Lys Asp Gly Asn Phe Leu Lys Leu Pro 
35 40 45 

Asp Thr Asp Cys Arg Gin Thr Pro Pro Phe Leu Val Leu Leu Val Thr 
50 55 60 

Ser Ser His Lys Gin Leu Ala Glu Arg Met Ala He Arg Gin Thr Trp 
65 70 75 80 

Gly Lys Glu Arg Met Val Lys Gly Lys Gin Leu Lys Thr Phe Phe Leu 
85 90 95 

Leu Gly Thr Thr Ser Ser Ala Ala Glu Thr Lys Glu Val Asp Gin Glu 
100 105 110 

Ser Gin Arg His Gly Asp He He Gin Lys Asp Phe Leu Asp Val Tyr 
115 120 125 



Tyr Asn Leu Thr Leu Lys Thr Met Met Gly He Glu Trp Val His Arg 
130 135 140 

Phe Cys Pro Gin Ala Ala Phe Val Met Lys Thr Asp Ser Asp Met Phe 
145 150 155 160 

He Asn Val Asp Tyr Leu Thr Glu Leu Leu Leu Lys Lys Asn Arg Thr 
165 170 175 

Thr Arg Phe Phe Thr Gly Phe Leu Lys Leu Asn Glu Phe Pro He Arg 
180 185 190 

Gin Pro Phe Ser Lys Trp Phe Val Ser Lys Ser Glu Tyr Pro Trp Asp 
195 200 205 

Arg Tyr Pro Pro Phe Cys Ser Gly Thr Gly Tyr Val Phe Ser Gly Asp 
210 215 220 

Val Ala Ser Gin Val Tyr Asn Val Ser Lys Ser Val Pro Tyr He Lys 
225 230 235 240 

Leu Glu Asp Val Phe Val Gly Leu Cys Leu Glu Arg Leu Asn lie Arg 
245 250 255 

Leu Glu Glu Leu His Ser Gin Pro Thr Phe Phe Pro Gly Gly Leu Arg 
260 265 270 

Phe Ser Val Cys Leu Phe Arg Arg He Val Ala Cys His Phe He Lys 
275 280 285 

Pro Arg Thr Leu Leu Asp Tyr Trp Gin Ala Leu Glu Asn Ser Arg Gly 
290 295 300 

Glu Asp Cys Pro Pro Val 
305 310 



<210> 2 
<211> 2775 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 

<222> (402). . (1331) 
<400> 2 

gtgaattcct ctttctctgc tggagctggg atattcttct tctcctgccc ttggacatca 



gagctgcagg ctctctggcc tttggacccg aggatttata ccaagcaggt ttctgggttc 120 

tcaggccttt ggccttggac tgatagttac accattggca tatctggttc tgaggctctt 180 

ggtcttggac tgagccacac tcctggcatc ccagcgtctc cagcttgcat ggcctgtcac 240 

gtgattcctg tcagaatcac catttttggt aaacaaacca agcccagaac ctgataatta 300 

tggagcattc tacactgaca gttctttgag acaaatttcc tcttggcatt tacactgtgg 360 

ctttagcttt caaaccagag gttcctctta cccagcaaaa a atg get ttc ccg aag 416 

Met Ala Phe Pro Lys 
1 5 

atg aga ttg atg tat att tgc ctt ctg gtt ctg ggg get ctt tgt ttg 464 

Met Arg Leu Met Tyr He Cys Leu Leu Val Leu Gly Ala Leu Cys Leu 

10 15 20 

tat ttt age atg tac agt eta aat cct ttc aaa gaa cag tec ttt gtt 512 

Tyr Phe Ser Met Tyr Ser Leu Asn Pro Phe Lys Glu Gin Ser Phe Val 

25 30 35 

tac aag aaa gac ggg aac ttc ctt aag etc cca gat aca gac tgc agg 560 

Tyr Lys Lys Asp Gly Asn Phe Leu Lys Leu Pro Asp Thr Asp Cys Arg 

40 45 50 

cag aca cct ccc ttc etc gtc ctg ctg gtg ace tea tec cac aaa cag 608 

Gin Thr Pro Pro Phe Leu Val Leu Leu Val Thr Ser Ser His Lys Gin 

55 60 65 

ttg get gag cgc atg gee ate egg cag acg tgg ggg aaa gag agg atg 656 

Leu Ala Glu Arg Met Ala He Arg Gin Thr Trp Gly Lys Glu Arg Met 

70 75 80 85 

gtg aag gga aag cag ctg aag aca ttc ttc etc ctg ggg acc ace age 704 

Val Lys Gly Lys Gin Leu Lys Thr Phe Phe Leu Leu Gly Thr Thr Ser 

90 95 100 

agt gca gcg gaa acg aaa gag gtg gac cag gag age cag cga cac ggg 752 

Ser Ala Ala Glu Thr Lys Glu Val Asp Gin Glu Ser Gin Arg His Gly 

105 110 115 

gac att ate cag aag gat ttc eta gac gtc tat tac aat ctg acc ctg 800 

Asp He lie Gin Lys Asp Phe Leu Asp Val Tyr Tyr Asn Leu Thr Leu 

120 125 130 

aag acc atg atg ggc ata gaa tgg gtc cat cgc ttt tgt cct cag gcg 848 



Lys Thr Met Met Gly He Glu Trp Val His Arg Phe Cys Pro Gin Ala 
135 140 145 

gcg ttt gtg atg aaa aca gac tea gac atg ttc ate aat gtt gac tat 896 

Ala Phe Val Met Lys Thr Asp Ser Asp Met Phe He Asn Val Asp Tyr 
150 155 160 165 

ctg act gaa ctg ctt ctg aag aaa aac aga aca acc agg ttt ttc act 944 

Leu Thr Glu Leu Leu Leu Lys Lys Asn Arg Thr Thr Arg Phe Phe Thr 
170 175 180 

ggc ttc ttg aaa etc aat gag ttt ccc ate agg cag cca ttc age aag 992 

Gly Phe Leu Lys Leu Asn Glu Phe Pro He Arg Gin Pro Phe Ser Lys 

185 190 195 

tgg ttt gtc agt aaa tct gaa tat ccg tgg gac agg tac cca cca ttc 1040 

Trp Phe Val Ser Lys Ser Glu Tyr Pro Trp Asp Arg Tyr Pro Pro Phe 
200 205 210 

tgc tec ggc acc ggc tac gtg ttt tct ggc gac gtg gcg agt cag gtg 1088 

Cys Ser Gly Thr Gly Tyr Val Phe Ser Gly Asp Val Ala Ser Gin Val 
215 220 225 

tac aat gtc tec aag age gtc cca tac att aaa ctg gaa gac gtg ttt 1136 

Tyr Asn Val Ser Lys Ser Val Pro Tyr He Lys Leu Glu Asp Val Phe 
230 235 240 245 

gtg ggg etc tgc etc gaa agg ctg aac ate aga ttg gag gag etc cac 1184 

Val Gly Leu Cys Leu Glu Arg Leu Asn He Arg Leu Glu Glu Leu His 
250 255 260 

tec cag ccg acc ttt ttt cca ggg ggc tta cgc ttc tec gta tgc etc 1232 

Ser Gin Pro Thr Phe Phe Pro Gly Gly Leu Arg Phe Ser Val Cys Leu 

265 270 275 

ttc agg agg ate gtg gee tgc cac ttc ate aag cct egg act etc ttg 1280 

Phe Arg Arg He Val Ala Cys His Phe He Lys Pro Arg Thr Leu Leu 
280 285 290 

gac tac tgg cag get eta gag aat tec egg ggg gaa gat tgt ccg cct 1328 

Asp Tyr Trp Gin Ala Leu Glu Asn Ser Arg Gly Glu Asp Cys Pro Pro 
295 300 305 

gtc tgaggggagc ccagaggcac ateeggacaa gtttcagata acccgtgggg 1381 

Val 

310 

atagtttttg ctagattttg gaagaggggg egggacagag gatgctgttc ttcagtgctg 1441 



aaatccacgc 


cagaatgtcg 


gtgt tcatga 


agtcactgat 


tagttcccac 


ttggtgcccc 


1501 


aggcaataat 


aggcccgtct 


ct tgggcacg 


cacactcttc 


atactaagtg 


1 1 tgacatac 


1561 


acctggattt 


t tgcat t tea 


ggggtcagta 


tcctatgaca 


tgatgggtgt 


taccatcc ta 


1621 


at 1 1 tacagg 


caaggacaca 


geagctgega 


gaggtacaga 


aact tgtccc 


aaggctcaca 


1681 


gccagtaggc 


ataggagegg 


gaatgaaaat 


cgagcactgt 


cagaatctgg 


tgggcagccc 


1741 


c tgac t tgaa 


ccactcccac 


gtsc tscc t c 


cc 1 1 asrsass 


ggacac tgat 


gatgaggtct 


1801 


cggagccggc 


atcct tccat 


ccc tgtcgag 


tcccctccac 


c tcagc tccc 


agtcct tgtg 


1861 


c 1 1 1 1 1 ggag 


c t aagee tgg 


gatgaccaaa 


t tcaccccag 


ctcct teat t 


cacagggc tg 


1921 


gatgtagctg 


ggat tgagtc 


catgt tatcg 


geteggtact 


caacacaacc 


caagt t teat 


1981 


ccsassaaat 


gt ccccgcag 


tggatgeage 


teacatgetg 


aggaacaccc 


agctctggac 


2041 


agagt tct ta 


taaatgtata 


aat taggctc 


agaaaccac t 


gcat tctgac 


ctgctgtaca 


2101 


gac tgcccac 


actgctgacc 


tgee tagega 


gcaggacatc 


cc t tctgagc 


catc tgc tgc 


2161 


tctctcattt 


catcacccca 


ac tgtccc 1 1 


gt 1 1 1 tgatc 


aatggggacc 


agccactgcc 


2221 


ccaggagcac 


1 1 ta2£2C tc 


tcagt tcaaa 


ctsaassaca 


gt tgaac tea 


gatggggt tc 


2281 


a t £?t frpffa 1 1 

a *• 6 L 666" <- 


C t fffffffl 2T t t 


t r t ffffffaa 1 1 


ra^t tffffrieft 


caagt cagga 


tgc t c t caag 


2341 


gacccc tcgg 


gctcagagcc 


c t aaastsss 

L UUU5 L ' o o o 


ccc tggtgaa 


sfcassstfffft 


cc tgegtcca 


2401 


c 1 1 cccaagc 




ctcatct tea 


t tgaatgtct 


cat t tggccg 


aggaacaac t 


2461 


gaac 1 1 tgtg 


gt t tgc tgt t 


tagect tcag 


tttgctccgc 


tgcctcctac 


ccagaggt 1 1 


2521 


gtgcgagcct 


gtgt tgcagg 


gt tgtataaa 


accaaggtac 


t tegt tagt t 


t tgeccat tc 


2581 


agccatggtc 


acgtgacatg 


caaagtaatc 


ttgctcctaa 


t tatagaaat 


gatttttctt 


2641 


ttaatttttt 


actttaccag 


actttacttt 


gtactcagag 


aagaggcetc 


acatggctgt 


2701 


gtcacatata 


aatgttggac 


taaactctta 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


2761 


aaaaaaaaaa 


aaaa 










2775 
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<210> 3 
<211> 10562 
<212> DNA 
<213> Homo sapiens 

<220> 

<2 2 1 > promoter 
<222> (1).. (5000) 

<220> 

<221> exon 

<222> (5001).. (5140) 

<220> 

<221> exon 

<222> (5001).. (5273) 

<220> 

<221> exon 

<222> (5459).. (5567) 

<220> 

<221> exon 

<222> (7427).. (7586) 

<220> 

<221> exon 

<222> (8234).. (10562) 

<400> 3 



cgcctctggc 


aaggtagacc 


ttgaaggcaa 


aactgagttg 


aggtttgtta 


ggacggaaat 


60 


aat tactgct 


gggcatgcag 


cact tcccaa 


ccgttctgtg 


aggcaggcag 


tgttattgcc 


120 


agtttggcac 


aagggcacag 


gtgtagaaca 


cgtaagtgcc 


ctgggccgtg ctacaccacc 


180 


actgtgtttg agctgagatg 


tgaaccaggg 


ccttctgatt 


ccaaattcct 


cattcctttc 


240 


atcctagcag 


gctgcctgcg 


gttagcagaa 


ggggactcct 


gtatctgctc 


tgcagcttct 


300 


tcagctgatt 


tataatggaa 


aacagagtag 


atattgattt 


ggcaattagt 


gaaatat tat 


360 


gagaatcatc 


atagcaaact 


tcacagt t tg 


atcaaggatc 


ctgcct tcaa 


tatctggcca 


420 


actgatgtgt 


aaaagcagct 


gcaagaact t 


cagagctgac 


aaaaaaagca 


aactccagac 


480 


tttatttcct 


ggaatctgt t 


ttgtgagaca 


ctggcccatg aaatgctctc 


ccagaaatag 


540 
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tcggatttgt 


ggtcaaataa 


atttgggcaa 


ttctacagaa 


catgtgtct t 


tttcagagat 


600 


ttatttttaa 


ttaaact tat 


t taaaaatat 


taacatggta 


caatttgeat 


atagtgaagt 


660 


gtgcaaatct 


tcgctacatg 


gctcaatgag 


tttttacata 


tatttccacc 


catgtaatca 


720 


ccaccgagat 


ctagaataga 


atgtat tegt 


ctctccagag 


gttcctctgt 


tcccttgcag 


780 


tcaataattc 


tccaccaaag 


ataaacattc 


tgetgactte 


cttcatcat t 


gataagt 1 1 1 


840 


tcctggt ttt 


gaact tcata 


taaatggaat 


catttcatgt 


ctagctcttt 


tcactcaaca 


900 


taacatctgt 


cagat taata 


categcatgt 


caatagtttg 


tagtttttat 


ggctgtgtag 


960 


catttcattc 


at teat tctg 


ctgttgatgg 


aaatttcttg 


actagagttg 


tgttatcaat 


1020 


ttattgcctc 


ttggctccaa 


attcaccct t 


tttgcctggt 


ctctggaaat 


gggtctgggc 


1080 


cctctaaata 


ttttcccttt 


gcaatctggc 


tcttgaagct 


tatcagtgga 


gggtcctgga 


1140 


ggggcattgc 


aggggaaaca 


gtttttcctt 


cctggttcag 


ggacgctggt 


ttggtagccc 


1200 


ctgtggtgtt 


acaggagtac 


ttggcaagac 


agcagtttcc 


ccgggtacac 


ccgctaggtg 


1260 


ttttgtagca 


gagtgeatet 


gtgagacaac 


tggtgaactg 


ctttccttgc 


aacctagagg 


1320 


gcagat t tct 


ggcaagttcc 


agagggtgga 


ttccaaacat 


gttcctctaa 


tgtggatctg 


1380 


cagtgatgtc 


tctgccattc 


agtgggcata 


gctgtgccct 


tttagtgggg 


tctagatctc 


1440 


agccctggga 


ttgggggcat 


tttctcagtt 


gctcaatctc 


agecctaggg 


gcagtgacca 


1500 


ttccttatat 


atggttgttt 


gtatattctt 


tggaactgtc 


ttgattttat 


tactgttagt 


1560 


ctctcattac 


tccactccct 


tattatagta 


aatgattctt 


tgtatttgac 


tttccctggt 


1620 


caaggtacta 


agtggttttt 


ctctcctgtt 


tggatctaga 


ttgatacaat 


aattttttgc 


1680 


aattatgagc 


aatgctgctg 


tgaacattct 


tgatcatgtc 


ttagtggaca 


taagcaataa 


1740 


ttgctgctgg 


gtctgtgtgc 


aggaatgatt 


tgetagatea 


tagcacatac 


gtttatctgt 


1800 


agcagaaact 


gecaaggtat 


atacagct t 1 


ccaaagtgtg 


ttaatttaca 


tgectgetag 


1860 


caaggtagga 


gaaatacagt 


tgcttcccaa 


gtagctggga 


ttacaggtgt 


gtgccaccac 


1920 


tcctggctaa 


ttttttgtag 


agatagggtt 


ttgccatgtt 


ggecaggcta 


gtcttgaact 


1980 
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gctggcctca 


agtgatctgc 


ctgccttggg 


ctcccaaagt 


gctaggatta 


cagggatgag 


2040 


ccaccatggt 


cgacttcatg 


ataaaacttc 


agtggatgag 


gagctgcctc 


ttatgatgaa 


2100 


caaagaaggt 


ggtttcttga 


aatggaatct 


actcctggtg 


aagatgctgt 


gaacattgtt 


2160 


gaaatgacaa 


gaaagaattt 


acagtgttac 


atagagttag 


ttgatgaagc 


agtagcagga 


2220 


ttcgagagga 


tcgattccaa 


tttcaaaata 


agttcttctg 


tgggtaaaat 


gctatcaaat 


2280 


ggcgtcgcat 


gctacagaga 


aatctatcat 


gaaaggaaga 


gtcaattgat 


gtggcaaact 


2340 


tcattgttgt 


cgtattttaa 


gaaattgtca 


ggaccacccc 


aaccttcaac 


aaccatgacc 


2400 


ctgatcagtc 


aggagecate 


cacattgagg 


cgagaacctc 


cagcagtaaa 


aagat tatga 


2460 


ttctctaaag 


gatcagatga 


acattagcat 


ttttttaagc 


aataaagtat 


ttttacgtaa 


2520 


gatatgtatg 


ttatttttta 


ggcataatgc 


tattatgeat 


ttaatagact 


ccagtatatt 


2580 


gtaaacataa 


etttaaatge 


actgggagat 


aaaagtat 1 1 


gctcttttat 


gatatttget 


2640 


ttattgcagt 


agtctgtaat 


ggaaactaca 


ttatctcttg 


ggtacacctg 


tatacagaaa 


2700 


gaaat t tatc 


atgaggaaat 


getcatgeaa 


tgatggaggc 


tggaaagtcc 


caagatctgc 


2760 


agtcagcaag 


ttggagaccc 


atgagagtcg 


atggtgtggc 


cccagtctgt 


gtttgaaggc 


2820 


ctgagaacca 


ggagagccaa 


aagctggcaa 


gctctggacc 


caggaagagc 


tgatgtttca 


2880 


gttcaagtcc 


gaaggcagga 


aaagactgaa 


ggcccagetc 


caggcagtca 


ggcaggagaa 


2940 


ctttcctttt 


actcacgaga 


gggtcagcat 


tttgttctgt 


tcaggctgtc 


aactgagtga 


3000 


cccagaaaag 


ctggcacata 


acattcacca 


tegtgetgea 


agagctgeaa 


aaccctctct 


3060 


get tctaaca 


ctgatgetea 


gcccacctcc 


agtgggcagg 


gagctgggtg 


cegggaggae 


3120 


ttggggttgc 


cagcccagtg 


tgggcctgga 


cagttgctga 


gaatctccct 


ccgccctgtg 


3180 


acttcttaat 


tacttagagg 


gtcaccctgg 


ttgetcaett 


cagctcact t 


gggagattct 


3240 


etctgettga 


ggccaggggt 


aggtccaggt 


ctgatggggt 


ctgaagctta 


tacaattggg 


3300 


gtggggggca 


t tcctct t ta 


aggaaaagaa 


aacagcagag 


gtgagtcgtc 


ctgeagctta 


3360 


get tcactag 


tctcgtggaa 


aatttgeetc 


ctaaaatgtc 


tttcctctga 


gaaagcccag 


3420 



gcctccaaag 


gcccagccag 


gctggcgtca 


gtctggggtc 


agctgcgggg 


aggctccagg 


3480 


ccatgtgtga 


cacgggagtt 


taccccatcc 


cagtttccag 


tggagaagca 


tcgttctcgt 


3540 


ccacccccgt 


catgctcttt 


tccaccttct 


ccaggggaag 


ggatattttc 


agtctgtaca 


3600 


acgaatccac 


tgaaatgtta 


aggtgggcac 


agtggtctgg 


ggtcttcgac 


cttgtttata 


3660 


cgtggtgcct 


gttactggtc 


gggtctgtac 


aggcagtttc 


cactggcgtt 


ttttaccagg 


3720 


ccagcctaga 


gtagaatgac 


cgcatgttaa 


aatatagat t 


cctgggcccc 


agacctgcca 


3780 


gaatctctgt 


gagatggaac 


ctttcaacct 


gggttttaaa 


caagcctccc 


gggtgattct 


3840 


gacactcact 


ggatttgaga 


accgtggggt 


tgttcagaca 


gcagggacgt 


tgatgttgtt 


3900 


ccttctgcgt 


tcctggtgat 


gctgttctgt 


tctcccaagg 


cctatgcggt 


ggtgagaatc 


3960 


tccaaggata 


caagacagtg 


atctggagcg 


agtgtcctga 


aagcagactc 


tagcactcag 


4020 


gactgccaac 


accctccccg 


ggtttccttg 


gtctggaatt 


cccatcccct 


ggt tccacct 


4080 


gttacatcac 


acctcccct t 


caaggaccag 


tgcagatgcc 


acgtcct tea 


eggggctcag 


4140 


aatgctcacc 


agcttcctct 


ccaccgaggg 


ccacagcccc 


tggagacccc 


ttgagctgag 


4200 


tgctttgtcc 


ttgcatactc 


tttctggcct 


catagtgggg 


ettggecatt 


gtcccttcac 


4260 


tccagatctc 


tcctttcagg 


tccaggaagt 


gcatcttgaa 


cttaactttc 


cagacccccc 


4320 


cttcagtttt 


ccagtcctta 


gagaggtgga 


cttctgattc 


ctttgtctct 


gtgccctgta 


4380 


gcctcaggtc 


aggcttaagg 


caaggtctcc 


tcacctggcc 


tggggagagt 


cccaggacgc 


4440 


tgcacgtgcc 


tgtgcgggta 


ggatgctgat 


gcccagattt 


cccgt tagag 


agcctttccc 


4500 


tatcctgacg 


gctctagct t 


tgtgtgttac 


ttacttgttc 


cactttaatt 


caaaatgtac 


4560 


ccagcaacca 


gcttgtgcac 


agttctctgg 


ggtttcagga 


gggatgtaag 


acatacccct 


4620 


tgcccttcag 


gcactatggc 


cagaaggggg 


gcagtgacct 


aggcagaggg 


egggagecag 


4680 


cagatgggat 


acactcagag 


gagcctgcag 


caggcagagg 


cagaggagaa 


gggaggtcta 


4740 


cacgttctgc 


actgtattta 


tctcct tcag 


t tccaaggt t 


ctctcctggc 


atctatattg 


4800 


ctcatgagt t 


acagagcaaa 


gcctggtgtg 


atggttactt 


ttaggtgtca 


acttggctgg 


4860 
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at taataaat 


acctagagaa 


ctggtaaagc 


attatttctg 


ggtgtgtttg 


tgaaggtgtt 


4920 


tccagaggag 


attggctgtg 


agtcagtggg 


ctgagtgggg 


aggagctgee 


ctccatgtgg 


4980 


gcaggcacca 


tccattgact 


gggeccagat 


agaacaagaa 


ggcagaagaa 


atgtgaat tc 


5040 


etc t ttctct 


gctggagctg 


ggatattctt 


cttctcctgc 


ccttggacat 


cagagctgea 


5100 


ggctctctgg 


cct ttggacc 


cgaggattta 


taccaagcag 


gtttctgggt 


tctcaggcct 


5160 


ttggccttgg 


actgatagtt 


acaccattgg 


catatctggt 


tctgaggctc 


ttggtcttgg 


5220 


actgagccac 


actcctggca 


tcccagcgtc 


tccagcttgc 


atggcctgtc 


aeggtattte 


5280 


ccaacctccg 


taatcacget 


agecaattet 


tctaagaaat 


ttcttctcat 


ctatctgtct 


5340 


gtctatctat 


ctatctgtct 


acctaccgac 


ttacctacct 


acctgcctat 


ctatcttttg 


5400 


attaatctac 


ctatcaatct 


t tctatctat 


ccataacctg 


ttgattcgat 


ctctctagag 


5460 


aaccctgact 


aatacacctg 


gagtgcagaa 


tetgetggag 


aaactgecat 


tccgttattg 


5520 


actggctggt 


caggccatac 


agcctggtgg 


tctagatgtg 


tttggaggta 


gggcttctgt 


5580 


agcacagata 


gtgcctgttc 


atggctctgt 


cccaggtaag 


gcagagctag 


cttgtgctga 


5640 


gggcttctgc 


tttgcagctg 


gcctggggtg 


gctaggatct 


ggggacacag 


gctgcccttt 


5700 


ccaggctctg 


tctgctggtg 


ctgcaggtgc 


ccctacctcc 


tccttcagtg 


gaaggctggc 


5760 


ccccaggtcc 


tct ttaggee 


caatacagac 


teagecaaag 


atgcagatgt 


ctcatatatg 


5820 


aggattctga 


gctgtgactt 


ctggtggtaa 


ctccact tta 


ggcaggaaaa 


atgt tcaact 


5880 


geccatgaaa 


acaaatgacc 


cegggtcatt 


tgggtttggc 


acctgctctg 


ccagttgggt 


5940 


ttggcacctg 


ctctgcctcc 


tgggaaacag 


tttggccaat 


geactgeatg 


aggtgagege 


6000 


ccatccctgg 


gaatttagag 


ccctgtgaag 


ggtcctgagg 


agaggcacat 


cagagagaat 


6060 


gagaatttaa 


ggtttactgt 


taaagcaacc 


catagaaaag 


gagcagaat t 


attcaagcaa 


6120 


ggaaacaaag 


tagaaaaata 


tcttctttcc 


ettgeacttg 


gtttttatgt 


ttctctctaa 


6180 


aatgtat tgt 


gggggagaaa 


gcagtccccc 


aaccccccta 


ateagctgea 


tatcttagee 


6240 


atgcaaataa 


ataatgaaag 


agagaaagaa 


ggagagaaag 


agaaagaaaa 


gtaaaggaag 


6300 
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gaaggagaga 


aggaaggaaa 


gaaagaaaag 


aaagaaagaa 


actttgeage 


atcctggagc 


6360 


accagttcag 


acaagttctg 


gtctcctgct 


tgccttctgc 


tgtgatttct 


ctgaagttgc 


6420 


tgggggcagg 


agctgggcag 


gaactcccca 


ggggtgccaa 


gcagagcagg 


tagttggcta 


6480 


agt ttgcctc 


caggaaagaa 


gtccctggag 


agegagctgg 


ttctagaaag 


ctccattatt 


6540 


atattcctat 


tgcttttggc 


gaatatatgt 


agaacagaat 


tttgacaatg 


aaattttcag 


6600 


gtgctctttt 


ggccatcaaa 


ataaccagct 


cttggctggg 


cacagtggct 


tgcccttgta 


6660 


attccagtgt 


tttgggaggc 


caaggcaggg 


gactgcgtga 


geccaggagt 


ttgagactag 


6720 


cctgggcaac 


atagtgaaac 


cccatctcta 


caaaaaatac 


aaagttagaa 


gagtatggtg 


6780 


gcatgcgcct 


gtggtcacag 


ctacttggga 


agctgtgacc 


caagtcacag 


gaggatcget 


6840 


tgagcccagg 


agttcagggc 


tgtagtgagc 


tatgattgtg 


ctactgtgct 


ccagcctggg 


6900 


agacagagtg 


acatcaggtc 


tctgaaatat 


taaaattaaa 


aagcccaaac 


caactctgct 


6960 


tttcactctt 


tcagtttcat 


ttcttgctgt 


cctctctgtc 


ctctcaccca 


gggtaacat t 


7020 


tttaaagtgc 


cgctattgtg 


ttaagaattg 


gatttattct 


ctgtgttaaa 


ttctctcagc 


7080 


attaactaca 


gactctgtta 


tgtaaataag 


gtaaattatc 


aggatgagaa 


gtgagactct 


7140 


aatttatgag 


tttatcatgt 


ctct t taaaa 


agetgetagg 


tgctatccta 


acttattagg 


7200 


cttgaaggat 


tctgggggat 


tggcatattg 


t tactgt tgt 


ggact t tgt t 


tgect tgatc 


7260 


atacccattt 


tacagatgag 


aaaagtgagg 


ctgggattgg 


ggctcaaatg 


cgtgctcaga 


7320 


gtcacataag 


taggt tggaa 


ggtgacgcta 


cagacaeggt 


aaattgtgaa 


ggcctgctgg 


7380 


taaggcacga 


gtgatttgaa 


tgacactct t 


tttttttttt 


tcctagtgat 


tcctgtcaga 


7440 


atcaccattt 


ttggtaaaca 


aaccaagccc 


agaacctgat 


aattatggag 


cat tctacac 


7500 


tgacagt tct 


t tgagacaaa 


tttcctcttg 


gcatttacac 


tgtggcttta 


gctttcaaac 


7560 


cagaggt tec 


tcttacccag 


caaaaagtga 


gttatacget 


ttcttaatgt 


tataaegtta 


7620 


ccatggatga 


tcctgaact t 


gecgaggata 


geagagaegg 


gtgggcagaa 


caggaaagaa 


7680 


tcagatcaga 


gactgtaaaa 


agtaacttaa 


aaaaaaataa 


ttctggcaga 


gacagaattt 


7740 
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gaaggtactt 


gtgcacatca 


gaacactgga 


ettgettttt 


tctgggagca 


ggaatgctgc 


7800 


t taat tagat 


cagagaagaa 


tgcaagtggt 


ccatacattt 


agatctacaa 


tgcgtggttt 


7860 


ccagacctgc 


agcttgtttt 


gctgcgcttc 


atcatggagt 


catagaaggg 


cagagctgga 


7920 


ggaccgagtg 


agggacctgg 


tgccatatcc 


ctacagacag 


gcaat tggag 


actcccgtag 


7980 


gttaagggct 


gcagagcctg 


gaccaatgcc 


cagaatctct 


gagcttttta 


tcttacacca 


8040 


tgaagtgaca 


gatgctggca 


gatgttagac 


ctttgtgctt 


aactgtttaa 


ccacacagca 


8100 


cccgact tct 


gtatgcagcg 


aggt tctaga 


gtttccaaaa 


cacgggtctc 


ctctcccacc 


8160 


tcagcctcct 


agcataaaac 


tagacacatc 


etcatgettt 


tgaggtctaa 


tcattggatt 


8220 


ttgttccttt 


cagatggctt 


tcccgaagat 


gagattgatg 


tatatttgee 


ttctggttct 


8280 


gggggctctt 


tgtttgtatt 


ttagcatgta 


cagtctaaat 


cctttcaaag 


aacagtcctt 


8340 


tgtttacaag 


aaagacggga 


acttccttaa 


gctcccagat 


acagactgea 


ggcagacacc 


8400 


tcccttcctc 


gtcctgctgg 


tgacctcatc 


ccacaaacag 


ttggctgagc 


geatggecat 


8460 


ccggcagacg 


tgggggaaag 


agaggatggt 


gaagggaaag 


cagctgaaga 


cattcttcct 


8520 


cctggggacc 


accagcagtg 


cageggaaac 


gaaagaggtg 


gaccaggaga 


gccagcgaca 


8580 


cggggacatt 


atccagaagg 


at t tcctaga 


egtctattae 


aatctgaccc 


tgaagaccat 


8640 


gatgggcata 


gaatgggtcc 


ategcttttg 


tcctcaggcg 


gcgtttgtga 


tgaaaacaga 


8700 


ctcagacatg 


ttcatcaatg 


ttgactatct 


gactgaactg 


cttctgaaga 


aaaacagaac 


8760 


aaccaggttt 


ttcactggct 


tcttgaaact 


caatgagttt 


cccatcaggc 


agecattcag 


8820 


caagtggttt 


gtcagtaaat 


ctgaatatcc 


gtgggacagg 


tacccaccat 


tctgctccgg 


8880 


caccggctac 


gtgttttctg 


gcgacgtggc 


gagtcaggtg 


tacaatgtct 


ecaagagegt 


8940 


cccatacatt 


aaactggaag 


acgtgtttgt 


ggggctctgc 


ctcgaaaggc 


tgaacatcag 


9000 


attggaggag 


ctccactccc 


agccgacctt 


ttttccaggg 


ggcttacget 


tetcegtatg 


9060 


cctcttcagg 


aggat eg tgg 


cctgccactt 


catcaagcct 


cggactctct 


tggactactg 


9120 


gcaggctcta 


gagaattccc 


ggggggaaga 


ttgtccgcct 


gtctgagggg 


ageccagagg 


9180 
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cacatccgga 


caagtttcag 


ataacccgtg 


gggatagttt 


ttgctagatt 


t tggaagagg 


09/10 

y Z4U 


gggcgggaca 


gaggatgctg 


ttcttcagtg 


ctgaaatcca 


cgccagaatg 


tcggtgt tea 


y ouu 


tgaagtcact 


gattagttcc 


cacttggtgc 


cccaggcaat 


aataggcccg 


tctct tgggc 


Q O C f) 

y oou 


acgcacactc 


ttcatactaa gtgtttgaca 


tacacctgga 


tttttgcatt 


tcaggggtca 


QA90 


gtatcctatg acatgatggg tgttaccatc ctaattttac 


aggcaaggac 


acagcagctg 


QA Qfi 


cgagaggtac 


agaaact tgt 


cccaaggctc 


acagccagta 


ggcataggag 


egggaatgaa 


0 C A fi 


aatcgagcac 


tgtcagaatc 


tggtgggcag 


cccctgactt 


gaaccactcc 


cacgtgctgc 


y buu 


ctcccttagg aggggacact 


gatgatgagg 


tctcggagcc 


ggcatccttc 


catccctgtc 


9ddU 


gagtcccctc 


cacctcagct 


cccagtcctt gtgctttttg gagctaagcc 


tgggatgacc 


Q 7 9 A 

y i zu 


aaattcaccc 


cagctccttc 


attcacaggg 


ctggatgtag ctgggattga gtccatgtta 


Q 7 QA 

y < ou 


tcggctcggt 


actcaacaca acccaagttt 


catccgagga 


aatgtccccg 


cagtggatgc 


QQ/I fi 

yo4u 


agctcacatg ctgaggaaca cccagctctg 


gacagagttc 


ttataaatgt 


ataaattagg 


QQfifi 

y yuu 


ctcagaaacc 


actgcattct 


gacctgctgt 


acagactgcc 


cacactgctg acctgcctag 


QQfifi 

yy ou 


cgagcaggac 


atcccttctg agccatctgc 


tgctctctca 


tttcatcacc 


ccaactgtcc 


10020 


cttgtttttg 


atcaatgggg 


accagccact 


gccccaggag 


cactttaggg ctctcagttc 


10080 


aaactgaagg 


acagt tgaac 


tcagatgggg 


ttcatgtggg 


attctgggag 


ctttctggga 


10140 


attcagttgg 


agtcaagtca 


ggatgctctc 


aaggacccct 


cgggctcaga 


gecctaaagt 


1 fi9AA 
1 UZUU 


gggccctggt 


gaagcagggt 


ggtcctgcgt 


ccacttccca 


agcctgagcc 


aagctcatct 


1 fi9 ft A 


tcattgaatg 


tctcatttgg 


ccgaggaaca 


actgaacttt 


gtggtttgct 


gtttagcett 


1 AQ9fi 

lUoZU 


cagtttgctc 


cgctgcctcc 


tacccagagg 


tttgtgcgag cctgtgttgc 


agggttgtat 


1 0380 


aaaaccaagg 


tacttcgtta 


gttttgccca 


ttcagccatg 


gtcacgtgac 


atgcaaagta 


10440 


atcttgctcc 


taattataga 


aatgattttt 


cttt taattt 


tttactttac 


cagactt tac 


10500 


tttgtactca gagaagaggc 


ctcacatggc 


tgtgtcacat 


ataaatgttg 


gactaaactc 


10560 


tt 
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<210> 4 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 4 

ttcagccacc taacagttgc cagg 



<210> 5 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 5 

ataccttctt cgtggcttgg tggag 



<210> 6 
<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 6 

tagaagctag aagagctatt cggc 



<210> 7 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 7 

actcgccagt gattgaacac aaac 
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<210> 8 
<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 8 

cccaatgcca agtacgtaat gaag 



<210> 9 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 9 

tgtggtgttc cttagcatga cctg 



<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 10 

ttgatcccca accaggaagc ttgc 



<210> 11 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 11 

tgaggccact gctcctctga tacg 
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<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 12 

gatatcgccg cgctcgtcgt cgac 



<210> 13 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 13 

caggaaggaa ggctggaaga gtgc 



<210> 14 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 14 
ctttagagca c 



<210> 15 
<211> 8 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 

<400> 15 
ctctaaag 



<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 
<222> (3) 
<223> i 

<220> 

<221> modi f ied_base 
<222> (9) 
<223> i 

<220> 

<221> modif ied_base 
<222> (15) 
<223> i 

<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 16 

gcnathmgnc aracntgggg 20 

<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 
<222> (6) 
<223> i 

<220> 

<221> mod i f i ed_base 
<222> (15) 
<223> i 

<220> 

<2 2 1 > modif ied_base 
<222> (21) 
<223> i 
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<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 17 

taygtnatga aracngaytc ngay 



<210> 18 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 
<222> (10) 
<223> i 

<220> 

<221> modif ied_base 
<222> (19) 
<223> i 

<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 18 

rtcrctrtcn gtyttcatna crta 



<210> 19 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 
<222> (4) 
<223> i 

<220> 

<221> modif ied_base 

<222> (7) 

<223> i 

<220> 

<2 2 1 > modi f ied_base 
<222> (10) 
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<223> i 
<220> 

<221> modi f iedjase 
<222> (16) 
<223> i 

<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 19 

rcanarnccn acrtanacrt cytc 



<210> 20 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 20 

accaccagca gtgcagcgga aac 



<210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 21 

gccacgatcc tcctgaagag gca 



<210> 22 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 22 

ctaagcttga aaggatttag actgtacatg c 



24 



23 



23 
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<210> 23 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 23 

ctaagcttgt ctgcctgcag tctgtatctg g 



<210> 24 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 24 

ctaagcttga caggccatgc aagctggag 



<210> 25 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic DNA 
<400> 25 

ctaagcttgt gaattcctct ttctctgctg 



31 



29 



30 
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